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PATENTS 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
re application of 
Anthony Keith CAMPBELL 

Serial No. 09/831,142 Box PCT 

(PCT/GB99/03654) Attention: DO/EO 

Filed May 7,2001 

PROTEIN AND DNA CODING THEREFOR 

AMENDMENT 

Commissioner for Patents 

Washington, D.C. 20231 

Sir: 

Responsive to the Notification of Missing Requirements 
mailed September 20, 2001, please amend the above-identified 
application as follows : 

IN THE SPECIFICATION : 

Kindly replace the specification originally filed with 
the attached substitute specification, provided in clean and red- 
lined versions. 

IN THE CLAIMS : 

Amend claim 2 as follows : 

2. (amended) A sequence according to claim 1, wherein 
the sequence that encodes for apopholasin is as shown in Figure 
4B (SEQ ID NO: 1) . 



CAMPBELL S.N. 09/831,142 



Amend claim 3 as follows: 

3. (amended) A sequence according to claim 1, wherein 
the sequence that encodes for apopholasin is as shown in any one 
of Figures 1, 2, 3, 4A, 6 or 9 (encompassing SEQ ID NOS 1-6 and 
23) . 

Amend claim 12 as follows: 

12. (amended) An isolated, purified or recombinant 
polypeptide according to claim 11 comprising the amino acid 
sequence of Figure 4 or Figure 5 (SEQ ID NOS 1 and 4-6) . 

Kindly make of record the attached substitute Sequence 
Listing, submitted in paper and disc formats. 

REMARKS 

The Notification of Missing Requirements mailed 
September 20, 2001, called for submission of a substitute 
Sequence Listing. A suitable substitute Sequence Listing has 
been prepared and is submitted herewith, in paper and disc 
formats . 

Applicant hereby states that the content of the paper 
and disc versions of the Sequence Listing is the same, and that 
these introduce no new matter into the present application. 

Additionally, the specification has been revised to 
introduce the sequence identification numbers at appropriate 
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locations, pursuant to the Sequence Listing requirements. As 
there are 42 sequences in the present application, the nature of 
the amendments to the specification were sufficiently extensive 
that it was considered expedient to prepare a substitute 
specification. Consequently, also attached to the present 
amendment is a substitute specification, both in clean form and 
in "red-line" form, the latter showing the nature of the 
amendments to the specification relative to the original 

specification . 

It is believed that this paper complies with the 
requirements set forth in the Notification of Missing 
Requirements mailed September 20, 2001, such that the application 
is now in condition for examination on the merits. Such action 
is respectfully requested. 

Attached hereto is a marked-up version of the changes 
made to the claims. The attached page is captioned "VERSION WITH 
MARKINGS TO SHOW CHANGES MADE." 

Respectfully submitted, 




Andrew J. Pat-ch 
Attorney for Applicant 
Registration No. 32,925 
745 South 23rd Street 
Arlington, VA 22202 
Telephone: 521-2297 

November 13, 2001 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 



IN THE CLAIMS : 

Claim 2 has been amended as follows: 

2. (amended) A sequence according to claim 1, wherein 
the sequence that encodes for apopholasin is as shown in Figure 
4B (SEQ ID NO: 1) . 



Claim 3 has been amended as follows : 

3. (amended) A sequence according to claim 1, wherein 
the sequence that encodes for apopholasin is as shown in any one 
of Figures 1, 2, 3, 4A, 6 or 9 (encompassing SEQ ID NOS 1-6 and 
23) . 



Claim 12 has been amended as follows: 

12. (amended) An isolated, purified or recombinant 
polypeptide according to claim 11 comprising the amino acid 
sequence of Figure 4 or Figure 5 (SEQ ID NOS 1 and 4-6) . 
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PATENTS 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
In re application of 
Anthony Keith CAMPBELL 
Serial No. (unknown) 
Filed herewith 

PROTEIN AND DNA CODING THEREFOR 

PRELIMINARY AMENDMENT 

Commissioner for Patents 

Washington, D.C. 20231 

Sir: 

Prior to calculation of the filing fee, please 
substitute Claims 1-30 as originally filed, with Claims 1-30 
as filed in the Article 34 amendment of December 7, 2000. The 
pages containing Claims 1-30 are marked "AMENDED SHEET" and 
are attached hereto. Following the insertion of Claims 1-30, 
please amend these claims as follows: 

IN THE CLAIMS : 

Amend claim 4 as follows: 

— 4. (Amended) A sequence according to claim 1, 
wherein the apopholasin is non-glycosylated . 

Amend claim 5 as follows: 

— 5. (Amended) A sequence according to claim 1, 
wherein the apopholasin is glycosylated. 

Amend claim 8 as follows: 

--8. (Amended) A construct according to claim 6, 
wherein the apophotoprotein is apopholasin. 



Anthony Keith CAMPBELL 
Amend claim 9 as follows: 

--9. (Amended) A recombinant construct according to 
claim 1, wherein the nucleic acid sequence is linked operably 
with nucleotides enabling expression and secretion of the 
apopholasin in a cellular host. 

Amend claim 10 as follows: 

— 10. (Amended) DNA or RNA according to claim 1. 
Amend claim 13 as follows: 

--13. (Amended) The apopholasin according to claim 
11 when expressed by recombinant DNA or RNA according to claim 
10. 

Amend claim 15 as follows: 

--15. (Amended) A cell, plasmid, virus or live 
organism having incorporated expressibly therein a sequence 
according to claim 1, whereby it is capable of producing an 
apoprotein . 

Amend claim 16 as follows: 

--16. (Amended) A vector comprising a sequence 
according to claim 1. 

Amend claim 18 as follows: 

--18. (Amended) A bioluminescent oxidative indica- 
tor protein (BOIP) r comprising an apophotoprotein according to 
claim 11 in association with a luciferin. 

Amend claim 23 as follows: 

— 23. (Amended) A method according to claim 21, 
wherein said BOIP is selected from native or chemically-or 
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Anthony Keith CAMPBELL 
genetically-modified BOIP or a 'rainbow protein' based on such 
a BOIP. 

Amend claim 24 as follows: 

--24. (Amended) A method according to claim 21, 
wherein said BOIP includes a signal peptide, targeting it to 
a pre-determined extra-or intra-cellular site. 

Amend claim 25 as follows: 

--25. (Amended) A method according to claim 21, 
comprising incubating a test sample with a cell, plasmid, 
virus or live organism having incorporated expressibly 
therein : 

(a) a sequence that encodes the apophotoprotein of 
pholasin (alternatively, A apopholasin' ) ; 

(b) a sequence substantially homologous to or that 
hybridises to sequence (a) under stringent conditions; or 

(c) a sequence substantially homologous to or that 
hybridises under stringent conditions to the sequence (a) or 
(b) but for the degeneracy of the genetic code; or 

(d) an oligonucleotide specific for any of the 
sequences (a), (b) or (c) PROVIDED THAT such homologous 
sequences according to (b) or (c) encode a protein capable of 
binding to luciferin. 

Amend claim 26 as follows: 

— 26. (Amended) A method according to claim 21, 
wherein light emission takes place in the absence of a 
lucif erase . 

3 



Anthony Keith CAMPBELL 
Amend claim 27 as follows: 

--27. (Amended) The use of a sequence or a protein 
according to claim 1 in the detection, diagnosis or measure- 
ment of oxygen or a metabolite thereof. 

Amend claim 29 as follows: 

--29. (Amended) A method for obtaining a substan- 
tially homologous source of apopholasin, which method com- 
prises culturing cells having incorporated expressibly therein 
a polynucleotide encoding apopholasin as defined in claim 1, 
and thereafter recovering the cultured cells. 

Amend claim 30 as follows: 

— 30. (Amended) A method, use or kit according to 
claim 20, substantially as hereinbefore described with 
particular reference to the Examples. 

REMARKS 

The above changes in the claims merely place this 
national phase application in the same condition as it was 
during Chapter II of the international phase, with the 
multiple dependencies being removed. Following entry of this 
amendment by substitution of the pages, only claims 1-30 
remain pending in this application. 

Attached hereto is a marked-up version of the 
changes made to the claims by the current amendment. The 
attached page is captioned "VERSION WITH MARKINGS TO SHOW 
CHANGES MADE". 



\ ■ I 



Anthony Keith CAMPBELL 

Respectfully submitted, 
YOUNG & THOMPSON 
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Anthony Keith CAMPBELL 
VERSION WITH MARKINGS TO SHOW CHANGES MADE 

The claims have been amended as follows: 

4. (Amended) A sequence according to any preceding 
claim 1, wherein the apopholasin is non-glycosylated. 

5. (Amended) A sequence according to any preceding 
claim 1, wherein the apopholasin is glycosylated. 

8. (Amended) A construct according to claim 6— or 
claim 7 , wherein the apophotoprotein is apopholasin. 

9. (Amended) A recombinant construct according to 
any one of claims 1 to 0 , wherein the nucleic acid sequence is 
linked operably with nucleotides enabling expression and 
secretion of the apopholasin in a cellular host. 

10. (Amended) DNA or RNA according to any of claims 

1 to 9 . 

13. (Amended) The apopholasin according to claim 11 
or claim 12 when expressed by recombinant DNA or RNA according 
to claim 10. 

15. (Amended) A cell, plasmid, virus or live 
organism having incorporated expressibly therein a sequence 
according to any one of claims 1 to 10 , whereby it is capable 
of producing an apoprotein. 

16. (Amended) A vector comprising a sequence 
according to any one — of claims- 1 to — 

18. (Amended) A bioluminescent oxidative indicator 
protein (BOIP) r comprising an apophotoprotein according to any 
one — of—claims- 11 to — 3r4 in association with a luciferin. 



Anthony Keith CAMPBELL 

23. (Amended) A method according to claim 21 or 22 , 
wherein said BOIP is selected from native or chemical ly-or 
genetically-modified BOIP or a 'rainbow protein' based on such 
a BOIP. 

24. (Amended) A method according to any one — 

claims 21 to 23, wherein said BOIP includes a signal peptide, 

targeting it to a pre-determined extra-or intra-cellular site. 

25. (Amended) A method according to crrry — orre — erf 
claims 21 to 23 , comprising incubating a test sample with a 
cell — according — bo — claim — 1-5 — or — with — a — membrane — preparation 
derived — therefro m , plasmid, virus or live organism having 
incorporated expressibly therein: 

(a) a sequence that encodes the apophotoprotein of 
pholasin (alternatively, A apopholasin / ) ; 

(b) a sequence substantially homologous to or that 
hybridises to sequence (a) under stringent conditions; or 

(c) a sequence substantially homologous to or that 
hybridises under stringent conditions to the sequence (a) or 
(b) but for the degeneracy of the genetic code; or 

(d) an oligonucleotide specific for any of the 
sequences (a) , (b) or (c) PROVIDED THAT such homologous 
sequences according to (b) or (c) encode a protein capable of 
binding to lucif erin . 
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26. (Amended) A method according to my — orre — of 
claims- 21— tro — 2-4-, wherein light emission takes place in the 
absence of a luciferase. 

27. (Amended) The use of a sequence or a protein 
according to any one of claims 1-Hro — in the detection, 
diagnosis or measurement of oxygen or a metabolite thereof. 

29. (Amended) A method for obtaining a substan- 
tially homologous source of apopholasin, which method com- 
prises culturing cells having incorporated expressibly therein 
a polynucleotide encoding apopholasin as defined in any one of 
claims 1 to 10 , and thereafter recovering the cultured cells. 

30. (Amended) A method, use or kit according to -any 
one — of— claims 20— to — 2-9-, substantially as hereinbefore de- 
scribed with particular reference to the Examples. 
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I m An isolated, purified or recombinant nucleic acid sequence comprising: 

(a) a sequence th# encodes the apophotoprotein of pholasin (alternatively, 
5 'apopholasin'); 

(b) a sequence substantially homologous to or that hybridises to sequence (a) under 
stringent conditions; or 

(c) a sequence substantially homologous to or that hybridises under stringent 
conditions to the sequence (a) or (b) but for the degeneracy of the genetic code; or 

1 0 (d) an oligonucleotide specific for any of the sequences (a), (b) or (c) 

PROVIDED THAT such homologous sequences according to (b) or (c) encode a protein 
capable of binding to luciferin. 

2. A sequence according to claim 1, wherein the sequence that encodes for 
apopholasin is as shown in Figure 4B. 
15 3. A sequence according to claim 1, wherein the: sequence that encodes for 
apopholasin is as shown in any one of Figuresi, 2, 3, 4A, 6 or 9. 

4> A sequence according to any preceding claim, wherein the apopholasin is npn- 
glycosylatecL 

5. A sequence according to any preceding claim, wherein the apopholasin is 
20 glycosylated. 

6. An isolated, purified or recombinant construct incorporating a sequence encoding 
an apophotoprotein whose expression in a substrate, in association with a lucifbrin 
therefor, signals the presence of oxygen or an oxygen metabolite in the substrate. 

7* An isolated, purified or recombinant construct incorporating a sequence encoding 
25 an apophotoprotein whose expression in a substrate, in association with a luciferin 
thereibr, signals the presence of oxygen or an oxygen metabolite in the absence of a 
corresponding luciferase in the substrate. 

8. A construct according to claim 6 or claim 7, wherein the apophotoprotein is 
apopholasin. 

30 9. A recombinant construct according to any one of claims 1 to 8, wherein the 
nucleic acid sequence is linked operabiy with nucleotides enabling expression and 
secretion of the apopholasin in a cellular host. 
1 0, DNA or RNA according to any of claims 1 to 9. 
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11. An isolated, purified or recombinsntpolypeptiae comprising apophotoproi ^ u Wv ^ Ow0 • 
pholasin (apopholasin), or a mutant or variant thereof, which mutant or variant is capable 
of binding to luciferin. 

12. An isolated, purified or recombinant polypeptide according to claim 11 
5 comprising the amino acid sequence of Figure 4 or Figure 5 . 

13. - The apopholasin according to claim 11 or claim 12 when expressed by 
recombinant DNA or RNA according to claim 10* 

14. The apopholasin according to claim 13, which is non-glycosylaled. 

1 5. A cell, plasmid, virus or live organism having incoiTporated expressibly therein a 
10 sequence according to any one of claims 1 to 10, whereby it is capable of producing an 

apoprotein. 

1 6. A vector comprising a sequence according to any one of claims 1 to 10. 

1 7. A host cell transformed or transfected with, a vector according to claim 1 6. 
IS. A biolmninescent oxidative indicator protein (BOXP), comprising an 

1 5 apophotoprotein according to any one of claims 1 1 to 14 in association with a luciferin,. 

19. A BOIP according to claim 18, wherein the luciferin is derived from Pholas 
dactyhts. 

20. A method for the preparation of a bioluminesceni oxidative indicator protein 
(BOIP), which method comprises bringing an apophotoprotein into association with a 

20 luciferin therefor. 

21. A method for the detection and/or measurement of oxygen or one of its 
metabolites extracellularly, which method comprises providing a bioluminescent 
oxidative indicator protein (BOIP) extraceliularly and thereafter detecting and/or 
quantifying light emission therefrom and/or changes in colour, intensity and/or 

25 polarisation of emission(s) T wherein the apophotoprotein comprises recombinant 
apopholasin. 

22. A method for the detection and/or measurement of oxygen or one of its 
metabolites in live cells (intraceiiularly), which method comprises providing a BOIP 
intracellularly and thereafter detecting and/or quantifying light emission therefrom and/or 

30 changes in colour* intensity and/or polarisation of emission(s) therefrom. 

23. A method according to claim 21 or 22, wherein said BOP is selected from native 
or chemically- or genetically- modified BOIP or a 'rainbow protein 1 based on such a 
BOIP. 
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24. A method according to any one of claims 21 tc 23. wherein sddBOIP hteiuuca 

a signal peptide, targeting it to a predetermined extra- or intra- cellular site. 

25. A method according to any one of claims 21 to 23, comprising incubating a test 
sample with a ceil according to claim 15 or with a membrane preparation derived 

5 therefrom, 

26. A method according to any one of claims 21 to 24, wherein light emission takes 
place in the absence of a luciferase. 

27. The use of a sequence or a protein according to any one of claims 1 to 19 in the 
detection, diagnosis or measurement of oxygen or a metabolite thereof, 

10 28. A diagnostic kit incorporating a sequence or protein according to any one of 
claims 1 to 19. 

29. A method for obtaining a substantially homologous source of apopholasin, which 
method comprises culturing cells having incorporated expressibly therein a 
polynucleotide encoding apopholasin as defined in any one of claims 1 to 10, and 

15 thereafter recovering the cultured cells, 

30. A method, use or kit according to any one of claims 20 to 29, substantially a* 
hereinbefore described with particular reference to the Examples, 
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PROTEIN AND DNA CODING THEREFOR 

JC08 Rec'd PCT/PTO 0 7 MAY 2Q0t 

The present invention relates to a protein, capable of bioluminescence, cDNA coding 
therefor and their uses, inter alia, in diagnostics and therapy. In particular, this invention 
relates to the cloning and sequencing of cDNA coding for pholasin from the bivalve 
mollusc Pholas dactylus. 

The term 'bioluminescence' refers to the emission of light resulting from a chemical 
reaction within, or produced- by, a living organism. The essential components to the 
chemical reaction are: an organic molecule, usually comprising a luciferin; oxygen or one 
of its metabolites; and an enzyme or luciferase that catalyses the oxidation of the luciferin. 
The chemiluminescent reaction responsible for bioluminescence may be represented as 
follows: 

Luciferin + 0 2 , or 0" 2 , or H 2 0 2 , or OH or OC1" or OX" or l 0 2 (+ luciferase) ► 

oxyluciferin + light. 

Up to three other substances may also be required to generate light or to generate light of 
the required colour and intensity. These are as follows: 

(a) A cation, such as IT, Ca 2+ , Mg 2 ' or a transition metal cation (eg Cu7Cu 2+ , 
Fe 2 7Fe 3+ , La 3+ and V 3+ ); 

(b) A co-factor such as NAD(P)H, FMN or ATP; and/or 

(c) A fluor as an energy transfer acceptor. 

Five chemical families of luciferin are known: 

(a) Aldehydes (found in the freshwater limpet Latia, earthworms, and with FMN in 
bacteria); 

(b) Imidazolopyrazines, which are the compounds most commonly responsible for 
bioluminescence in the sea (found in Sarcomastigophora, Cnidaria, Ctenophora, 
Annelida, Chaetognatha, some Arthropoda, some Mollusca and some Chordata); 

(c) Benzothiazoles (found in beetles such as fireflies and glow-worms); - 
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(d) Linear tetrapyrroles (found in dinoflagellates, euphausiid shrimp and some fish); 
and 

(e) Flavins (found in bacteria, fungi, polychaete worms and some molluscs). 

Chemiluminescent reactions involving these luciferins may produce a glow or a flash with 
an emission of violet, blue, blue-green, green, yellow, orange or red light, or occasionally 
UV or IR light. The light emission may be linearly or circularly polarised. The luciferin 
or its product may also be detected and quantified by fluorescence or phosphorescence. 
As a chemical reaction is directly responsible for the light emission, it does not require 
exposure to UV, visible or IR light. However, some bioluminescent systems, such as that 
in the red organ of the deep sea fish Malacosteus, exhibit a photo-chemiluminescence, 
where light can trigger or enhance the chemiluminescent reaction. [Reference is directed 
to Chemiluminescence: Principles and Applications in Biology and Medicine, A K 
Campbell (1988), Horwood/VCH Chichester, Weinheim.] 

In the case of some bioluminescent proteins, the luciferin is so tightly or covalently bound 
to the protein molecule that it does not diffuse away into the surrounding fluid as a result 
of the chemiluminescent reaction. In this case, the protein-luciferin complex is known as 
a photoprotein; and the protein itself is referred to as an apophotoprotein. Some 
bioluminescent proteins are proteins whose light emission or radiation depends on or may 
be altered by oxygen or one of its metabolites; these bioluminescent proteins are 
hereinafter referred to as 'bioluminescent oxidative indicator proteins' (BOIPs).BOIPS 
may thus be photoproteins or luciferin-luciferase systems. 

BOIPs, therefore, may be used to detect and quantify oxygen or one of its metabolites in 
individual cells, defined compartments of living cells such as the nucleus, whole organs 
and organisms - both animals and plants, including microbes such as viruses and bacteria 
and protozoa - as well as substances of biological interest such as substrates, metabolites, 
vitamins, drugs, intra- and extra-cellular signals,^ enzymes, antigens, antibodies and 
nucleic acids. Heretofore, it has only been known to employ native BOEPs 
extracellularly. 

The present invention therefore relates to a method for the detection and/or measurement^ 



3 

of oxygen or one of its metabolites in live cells (intracellular), which method comprises 
providing a BOIP, such as native or chemically- (or genetically-) modified BOIP or a 
'rainbow protein' based on such a BOIP, intracellular^ and thereafter detecting and/or 
quantifying light emission therefrom and/or changes in colour, intensity and/or 
polarisation of emission(s) therefrom. 

Furthermore, it has now been found that, by sequencing the BOIP and identifying the 
cDNA encoding therefor, the recombinant BOIP can also be used in such a method, or 
chemically- or genetically-modified recombinant BOIP, or a 'rainbow protein' based on 
such a BOIP. For example, the bivalve mollusc Pholas dactylus has been shown to 
comprise a native photoprotein, which interacts with a luciferase, when they are secreted 
together by the mollusc to produce light when 0 2 or one of its metabolites is present. 
References to the Purification and Properties of Pholas Dactylus Luciferin and Luciferase 
can be found by Michelson in Methods in Enzymology LVII 385-406 (1978). References 
to detection of activation of neutrophils by detection of superoxide anion can be found by 
Roberts in Anal Biochem 160 139-148 (1987) and by Muller et al in J Biolum Chemilum 
3 105-113 (1989). The native photoprotein (known as pholasin) is made up of a 
glycosylated apoprotein (34kDa) with a small organic molecule, the luciferin, tightly 
bound to it. This luciferin (whose structure is unknown - Muller and Campbell in J 
Biolum Chemilum 5 25-30 (1990)) can be extracted from the protein moiety - the 
apopholasin - or from the organism by a standard treatment, such as mild acid. The 
pholasin may be collected from live molluscs found in sedimentary rocks at low water 
along the south coast of England from Plymouth to Folkestone and also along the French 
channel coast and in the Mediterranean. Further details may be obtained from marine 
fauna and the references cited herein. 

We have surprisingly found that pholasin can generate light even without the presence of 
the corresponding luciferase by addition of oxygen metabolites such as 0 2 H 2 0 2 , OCl- 
or other oxyhalide anions, or organic peroxides, and certain organic solvents such as 
dimethyl sulphide (DMSO) or dimethyl formamide (DMF). 

We have now identified the cDNA encoding for the (non-glycosylated) apoprotein of 
pholasin, which may also be called 'apopholasin'. Accordingly, the present invention^ 



therefore further provides an isolated, purified or recombinant nucleic acid sequence 
comprising: 

(a) The apophotoprotein of pholasin (alternatively, 'apopholasin'); 

(b) A sequence substantially homologous to or that hybridises to sequence (a) under 
stringent conditions; or 

(c) A sequence substantially homologous to or that hybridises under stringent 
conditions to the sequence (a) or (b) but for the degeneracy of the genetic code; or 

(d) An oligonucleotide specific for any of the sequences (a), (b) or (c). 

The present invention will now be further described with reference to the accompanying 
Figures, in which: 

Figure 1 shows three different cDNAs encoding apopholasin, referred to as clones 40, 3 
and 5. Nucleotides in bold type show codons used for initiation and termination of 
translation; 

Figure 2 shows the three sequences of Figure 1 aligned to demonstrate the sequence 
similarity. This figure was generated by Clustal. Positions which are indicated with a 
star are identical in all three clones. The codons for the initiation and termination of 
translation are highlighted in bold; 

Figure 3 shows the oligonucleotides used for the complete sequencing of the positive 
clones. These were identified from the cDNA library; their positions in clone 40 are 
shown. Oligonucleotides are shown in bold type, portions of the flanking sequence of the 
Bluescript plasmid are shown in italic type; 

Figure 4 describes the protein sequence described by the DNA sequence coding for 
apopholasin and shows, in Figure 4A, the complete sequence of the positive clone 40 
identified from the Pholas dactylus light organ library. The first 20 amino acids at the Mr- 
terminus are a signal peptide, and this can either be retained or removed when generating 
the BIOP as described in this invention and, in Figure 4B, the cDNA coding for 
apopholasin with untranslated 5' and 3' ends. The untranslated regions are also shown; ^ 



Figure 5 describes the protein sequence for pholasin with (5B) and without (5A) the 
signal peptide; 

Figure 6 shows the sequence for apopholasin genomic DNA. Two gDNA clones were 

indentifed but no introns were found; the Figure shows an alignment of the cDNA from 

cDNA clone 40 and the gDNA amplified by both rTth DNA polymerase XL and BioXAct 

polymerase. The sequences of the PCR product and the inserts in pGEM T were aligned 

with the sequence of the cDNA of clone 40 and were identical to this cDNA; 

Figure 7 describes the oligonucleotides used for screening and expression. Degenerate 

oligonucleotides for library screening are shown in Figure 7A; non-degenerate ones in 

Figure 7B; and oligonucleotides used for protein expression are shown in figure 7C; 

Figure 8 lists the main restriction sites in the DNA for engineering pholasin; 

Figure 9 is a schematic representation of Figure 8 mapped to the sequence of Figure 4A 

(translated region), and 

Figure 10 is a time course of hypo-chlorite triggered chemiluminescence. 

Accordingly, the present invention provides recombinant DNA encoding the 
apophotoprotein apopholasin and comprising the nucleotide sequence of the sequence 
disclosed in Figure 4B. Three different cDNAs coding for apopholasin have been 
isolated, having differing non-coding regions, respectively disclosed in Figure 1. The 
genomic DNA (gDNA), which contains no introns, has been shown (Figure 6) to 
comprise the same basic sequence as the cDNA. 

Pholasin is a glycoprotein having 11.1 glusamine, 9.8 fructose, 7.1 mannose and 5.2 
galactose residues. The cDNA for apopholasin has a molecular weight of 23,456 
compared to 34,600 of the pholasin extracted from Pholas. The difference in the 
molecular weights of native versus recombinant apopholasin is due to the glycosylation of 
the native protein and luciferin. The isoelectric point of the translated protein calculated 
by the ISOELECTRIC command of the GCG programme is at 3.84. The native protein 
has a lower isoelectric point (<3.5), probably due to the presence of bound sulphate. 
The three clones (Figure 2) isolated from the library encode a unique protein (Figures 4 
and 5), which does not have the same amino acid sequence as any known protein in the 
SwissProt database. The present invention therefore not only provides cDNA and RNA^. 
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coding for the protein, but also the recombinant protein per se, with or without 
glycosylation units. A comparison of segments of the pholasin protein sequence with the 
proteins in the SwissProt database identified several proteins with regions having a high 
sequence similarity to regions of the cloned protein. These included several proteins 
which interact with nucleotides (Table 1). 



Table 1 A comparison of sections of the sequence of the cloned protein with sections 

of proteins which interact with nucleotides. / 



Protein 


Homologous region 
cloned protein 

homology (+ denotes a conserved amino acid) 
selected protein 


ckjn J\- splicing enaonuc lease p 
subunit 

Saccharomyces cerevisiae 
EC 3.1.27.9 


OJjI U Hi U 1M LN \-3 V L 1 LJ S—i'^J IX v It Olj J. X J- 1 

+L DEDNN + + G ++P E++E 
NLRDEDNNLLDENGDLLPLESLE 

LDQDVELDYTW 
LD DV DYTW 
LDHDVSKDYTW 


ATP -AMP transphosphorylase 
Cyprinus carpio 
EC 2.7.4.3 


VMDEGKVIPSETIEDDIKDCGLLDQDVELDY 
+M +G+++P +T+ D IKD + DV Y 
IMQKGELVPLDTVLDMIKDAMIAKADVSKGY 


DNA primase 

Synechocystis sp. 
EC 2.7.7. - 


EEVQCAMNWTQANEYVFNVD 
++VQ M ++Q+ + +FN D 
DQVQSLMRFS QS KQ IIFNFD 


purine permease 
Emericella nidulans 


VQCAMN WT Q AN EYV 
+ N ++■ 
IMCSVDWTRRNRFI 


DNA repair protein complementing 
XP-A cells homologue 
Drosophila melanogaster 


PDTVDEAEDTPSET 
PDT DE EDT + T 
PDTYDEEEDTYTHT 


ATP synthase {3 chain 
Peptococcus niger 
EC 3.6.1.34 


DTVDEAEDTPSET 
D +DEA + PSET 
DPIDEAGEVPSET 


DNA polymerase a 
Homo sapiens 
EC 2.7.7.7 


DEDNNGVMDEGKVIPSETIEDDIKD 
D+D G +++G+ I + +EDD D 
DDDGIGYVEDGREIFDDDLEDDALD 



Similarity was found between the Vargula luciferase and Renilla LBP ? but no other 
bioluminescent protein. 



Sequence homology between the cloned protein and (a) Vargula luciferase (b) Renilla 
LBP. An area of high homology in all three proteins is in bold print. 



(a) 148 _ 206 
GTIWTVRVSLYDEDNNGVMDEGKVIPSETIEDDIKDCGLLD-QDVELDYTWTQNECDL 

V+VSL D + 4- 4- T4-DID 4- V4-4- 4- + 

YWNTWDVKVSLRDVESYTEVEKVTIRKQSTWDLIVDGKQVKVGGVDVSIPYSSENTSI 
353 412 

(b) : 

105 166 
STMPGTYMLMDVCATRDADDKCIEGTIWTVRVSLYDEDNNGVMDEGPCVIPSETIEDDIKDC 

+ TR + VR+S+ + N+ K I 

AIKIAKLSAEKAEETRGFLRVADQLGLAPGVRISVEEAAVNATDSLLKMKAEEKAMAVIQSL 
41 104 



Three potential glycosylation sites on the protein have the consensus triplet sequence Asn- 
Xaa-Ser/Thr (where Xaa can be any residue except proline). Thr 216 was identified as a 
potential site of O-linked glycosylation by a neural network which has been trained to 
identify this type of glycosylation. The amino acid sequence was also entered into a neural 
network which had been trained to identify eukaryotic signal peptides. This confirmed that 
the most likely cleavage site is between positions 20 and 21 (GSG-EE). 

Many families of proteins contain a "signature" sequence of amino acids. The sequence of 
the clones did not contain any of these signatures present in the PROSITE database. " The 
amino acids from 170 to 185 correspond to the calcium binding consensus sequence 
[DENQST]XPENQST]X[DENQST]XPENQST]XPENQST]XX[DENQST]. Thirteen 
potential phosphorylation sites were discovered that matched the consensus sequences for 
either the kinase phosphorylation site [RK](2)-x-[ST] x the protein kinase C phosphorylation 
site [ST]-x[RK] or the casein kinase II phosphorylation site [ST]-x(2)-[DE]. 

Three N-linked glycosylation sites were identified in the translated sequence of the clones A 
neural network has been trained to identify this type of glycosylation which identified Th^ 
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216 as a potential site of O-linked glycosylation. At least one of these sites must be 
glycosylated in the native protein in order to account for the presence of the sugar residues. 
A putative signal peptide region preceded the N terminus of the secreted protein 
(determined by amino acid sequencing and was identified as a signal peptide by a neural 
network). To confirm this result the protein sequence was searched with PSORT for motifs 
which would locate the cloned protein in a cellular compartment. The protein sequence did 
not contain any transmembrane regions or N-myristoylation patterns which would indicate 
the presence of a lipid anchor. No targeting or retention sequences were found for the 
nucleus, mitochondria, endoplasmic reticulum or peroxisome. 

The fact that the clones had some sequence similarity with proteins that interact with 
nucleotides may suggest that pholasin binds a co-factor as part of the chemiluminescent 
reaction. Beetle luciferases require ATP binding for chemiluminescent activity. There is no 
P-loop binding motif ((A,G)x4GK(S,T) or (A)x {4} GK(T)) in the amino acid sequence of 
these clones. However, not all ATP binding proteins contain this motif. Neither does the 
cloned protein contain the GXGXXG phosphate binding consensus sequence necessary for 
the binding of other co-factors such as nicotinamide adenine dinucleotide. 

The amino acid and sugar components of pholasin are not able to emit light at the 
wavelength of the native protein (490nm). This indicates that there must be a chromophore 
bound to the protein. There are, however, proteins in which the chromophore is composed 
of modified amino acid residues within the polypeptide. The best characterised of these is 
the green fluorescent protein (GFP). This has a chromophore which is a ring formed by the 
autocatalytic cyclisation of the residues Ser-dehydroTyr-Gly. The serine may be mutated to 
a threonine, which increases the amplitude of the emission at 488nm. Pholasin had no 
similar amino acid sequence. Putative luciferin binding regions have been identified for two 
bioluminescent chemistries. Aequorin has a putative coelenterazine binding region, which 
is also present in two sections of the Vargula hilgendorfii luciferase. The sequence of the 
cloned protein has no homology to the putative luciferin binding site of aequorin, but the 
region of the Vargula luciferase from residue 353 to 411 has some similarity, as does the 
LBP of Renilla reniformis, which also binds an imidazolopyrazine. This may indicate that 
the chemistry of pholasin bioluminescence involves an imidazolopyrazine luciferin. 
However, the region of homology is very small. The beetle luciferases contain an area ofs 



low sequence homology which may bind the benzothiazole luciferin. This low homology 
may account for the different colours of beetle bioluminescence. used a luciferin analogue 
(2-(4-benzoylphenyl) thiazole-4-carboxylic acid which photoinactivated the luciferase active 
site of the firefly Photinus pyralis. This photoinactivation was directly linked to the 
degradation of a small peptide sequence HHGF (residues 244-257). This is therefore 
postulated as a luciferin-binding site. The cloned protein does not have any sequence 
homology with these putative binding regions. Two strongly conserved regions of amino 
acids have also been found in both the luciferase and the luciferin binding protein of the 
dinoflagellate Gonyaulax polyedra. These regions were compared to the cloned protein, but 
no sequence similarity was found. No sequence identity could be established between the 
bacterial luciferases and the cloned protein. 

Therefore, the present invention provides cloned apophotoprotein apopholasin (and the 
cDNA coding therefor), which has identical properties to native (but non-glycosylated) 
apopholasin with respect to molecular weight, amino acid composition, potential for 
glycosylation, its highly acidic pi and its cellular location. Hence, the present invention 
can further provide the corresponding BOIP or modified BOIP, according to standard 
methods. 

The corresponding BOIP is preparable by bringing the apophotoprotein pholasin into 
association with the luciferin, also using standard methods. Although the luciferin is 
tightly bound in the native pholasin BOIP, it has been found that it may not be the case in 
the recombinant pholasin BOIP; indeed the luciferin may be weakly bound or merely 
present with the apoprotein. For example, a methanol, aqueous, acidic or other extract of 
Pholas dactylus (whole organism or light organ dissected from the animal) containing the 
'luciferin', or the pure luciferin, may be added to the solution, cell or organism (Figure 10 
shows the time course of hypo-chlorite triggered luminescence in these circumstances). A 
time course of apopholasin reactivation was performed by incubating partially purified 
recombinant pholasin secreted by insect cells with acid:menthanol extract of Pholas 

dactylus (•) for 0 (solid line), 1 (— ), 2 ( ), 6( ) or 24 hours ( ), or incubated 

without extract (o). Controls of buffers only with no protein or acidimethanol extract ( ) 

and extract alone (■) were treated in identical conditions. Chemiluminescence was^ 



10 

triggered by the addtion of 2% soidum hypochlorite at 10 seconds (arrow) and is shown 
as chemiluminescent counts minus background light. A typical curve obtained by 

hypochlorite triggering of native pholasin is also shown (X). A representative experiment 
carried out in duplicate is shown. 

The luciferin associates with the apo-BOIP forming the photoprotein or remains loosely 
bound so that it turns over like a luciferase.. The luciferin on the photoprotein then reacts 
with oxygen or one of its metabolites to produce light, in the presence or absence of the 
luciferase. The light emission may be detected, quantified, or imaged using a 
luminometer, photographic film or imaging camera, or by the naked eye. Alternatively, 
light emission may be generated spontaneously by intra- or extra-cellular metabolites 
reacting with the apo-BOIP. 

Although illustrated with respect to pholasin, the following may apply to any BOIP: the 
BOIP can be produced directly from native DNA, or from DNA engineered or amplified 
by the polymerase chain reaction. By this means, sites can be inserted within the protein 
by splitting the DNA into two or more pieces, or by adding DNA sequences to the 5 ; or 3' 
ends. For example, the DNA may be expressed in bacteria, yeast, an insect or human cell, 
or other suitable organism to produce protein which can be extracted and used. 

In this instance, the protein produced from the cloned DNA reacts with oxygen or a 
metabolite of oxygen, such as the superoxide anion (0 2 ~), hydrogen peroxide (H 2 0 2 ), 
hydroxy radical (OH ), an oxyhalide anion (OCT, OBr", 01", OSCN"), nitric oxide (NO), an 
organic hydroperoxide or a radical ROO . The change in light emission enables the 
oxygen or metabolite(s) to be detected and quantified in live cells, organelles, or on the 
outer or inner surface of the plasma membrane, or within an organ of a live organism 
without the need to break them open or the need to separate bound and free fractions. 
This also enables an enzyme producing oxygen or one of its metabolites, such as 
chlorophyll, or enzymes such as oxidases and oxygenases which react directly with 
oxygen or one of its metabolites to attach oxygen to the substrate to be detected and 
quantified in live cells, organs and whole organisms, or extracts from any one of these. 

Also the BOIP can be made in vitro by transcription/translation in a cell lysate such as/ 
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rabbit reticulocyte lysate or wheat germ extract containing RNA polymerase. The DNA 
for the BOIP is first engineered to contain an RNA polymerase promoter, such as T7, 
SP6; bacterial promoter(s), such as lac, ara or trp; or mammalian promoter(s), such as 
actin, myosin, myelin proteins, TK, MRT-V, SV40, CMV, RSV, metallothionine, 
antibody, G6P dehydrogenase, and can be amplified in vitro using the polymerase chain 
reaction. A poly-A tail may be added at the 3' end and a tissue specific promoter or 
enhancer sequence added to the 5' or 3' end of the DNA coding for the BOIP or modified 
BOIP, enabling it to be expressed specifically in a target cell, such as a myocardial cell or 
a cancer cell. The expression of the BOIP in the target cell is detected and quantified by 
light intensity, colour or polarisation, as previously mentioned. 

The BOIP, or its DNA or RNA, may be incorporated into a live bacteria or eukaryotic cell 
using phage, virus, plasmid, calcium phosphate transfection, electroporation, liposome 
fusion, membrane pore forming proteins, micro-injection or DNA gun. Once inside cells 
or an appropriate extracellular environment, cell activation or injury will initiate or 
change the light emission from the BOIP. For example, expression in live organisms by 
micro-injection of protein, RNA or DNA, or by transgenic manipulation to produce a cell, 
such as a bacterial, microbial, animal or plant cell, eg a white blood cell, a heart cell, or a 
yeast, protozoan, fruit fly (Drosophila), nematode worm, polychaete worm, fish, human, 
mouse, rat, sheep, pig, horse or plant, which can generate its own light. 

The BOIP can also be incorporated into a defined part of a live cell by chemical means or 
by genetically engineering the BOIP to contain a signal peptide, which locates the BOIP 
to the inner or outer surface of the plasma membrane or within a particular organelle, such 
as peroxisome, mitochondrion, chloroplast, tonoplast, endoplasmic, reticulum, Golgi 
apparatus, endosome, lysosome, secretory vesicle, nucleus, nucleolus, nuclear membrane, 
plasma membrane, proteosome, or gap junction, or structure such as membrane receptor 
ion channel microtubule, cytoskeleton, nuclear skeleton, nuclear receptor, mitotic spindle 
or microfilaments. The signal peptide, added either chemically or genetically, will 
normally target the normal or modified BOIP to a particular intra- or extra-cellular site. 
For example, the sequence MLSRLSLRLLSRYLL or part of cytochrome oxidase on the 
N-terminus will target the BOIP to the mitochondrion; KKS ALL ALM Y V CP GKADKE> 
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or MLLPVPLLLGLLGLAA at the N-terminus will target the BOIP to the endoplasmic 
reticulum, a KDEL or HDEL or KEEL sequence at the C-terminus retaining it there. SKL 
at C-terminus targets BOIP to the peroxisome; PKKKRKV or an extension of this SV40 
large T-antigen signal will target it to the nucleus; and a palmitoylation and/or a 
myristoylation signal will target it to the plasma membrane. By coupling the BOIP to 
another protein that targets itself to a particular site, the BOIP can also be targeted there. 
For example, coupling the nuclear proteins nucleoplasmin or lamin B receptor to BOIP 
targets it to the nucleus; cytochrome oxidase at the N-terminus targets BOIP to the 
mitochondria; chlorophyll at the N-terminus targets BOIP to the chloroplast; a connexin 
at the N-terminus targets BOIP to the gap junction or plasma membrane; and SNAP 25 to 
the plasma membrane. 

Other modifications to the apoprotein, BOIP, or nucleotides coding therefor include, but 
are not limited to: 

The apoprotein, such as apopholasin, may also be glycosylated, and used to detect and 
quantify secretion or movement of proteins through the secretory pathway. 

Nucleic acid coding for the BOIP when expressed inside a live cell may not only be 
modified but also regulated in this cell by gene expression, such as by promoters, 
enhancers or oncogenes. For example, the apoprotein, such as apopholasin, may be 
coupled to a gene regulator protein, such as a transcription factor, by genetic or chemical 
manipulation, such that the movement through a cell or of the regulator protein or its 
activity, can be detected or quantified. 

The BOIP or apoprotein, or its DNA may be linked to another protein or DNA used in 
therapy, such that the other protein or DNA can be detected in live cells or in a whole 
organism, eg a human. 

The apoprotein, such as apopholasin, can also engineered genetically or chemically to 
contain a site or sites which can be covalently modified by enzymes such as 
phosphorylation (including ser/thr, his and' tyr kinases and phosphatases), 
tranglutamination, proteolysis, ADP ribosylation, gly-or glu-cosylation, halogenation^ 
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oxidation, methylation, palmitoylation, myristylation and farnesylation. 

The apoprotein, such as apopholasin, can be engineered genetically or chemically to 
contain an antigen or intracellular signal binding site, such as Ca 2+ , cyclic AMP, cyclic 
5 GMP, cyclic CMP, IP 3 , IP 4 , diacyl glycerol, ATP, ADP, AMP, GTP, or any oxy- or 
deoxy-ribonucleoside or nucleotide, a substrate, a drug, a nucleic and/or a gene regulator 
protein. 

The BOIP may also be converted to a rainbow protein by engineering a particular site 
10 such as described hereinabove into the BOIP, at the N- or C-terminus, or between a 
chimera of the BOIP and an energy transfer acceptor, such as GFP (wild type or any of 
the mutant GFPs). This is known as chemiluminescence, bioluminescence or 
fluorescence resonance transfer (CRET, BRET or FRET, respectively). Conversion of the 
BOIP to a 'rainbow protein' may be effected by reaction with a cellular substance, 
15 modification genetically or chemically, or by linking the BOIP to a fluor, such as the 
green fluorescent protein or the red fluorescent protein in the deep sea fish Malacosteus. 
The result is a BOIP which changes its colour and/or intensity and/or polarisation of 
emission. The change in colour occurs by energy transfer, eg resonance transfer (CRET 
or FRET) or electron transfer. 



r: 20 
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The initial (unmodified) BOIP may be the apophotoprotein, its DNA or RNA, from the 
bivalve mollusc Pholas dactylus, or it may be another BOEP, such as one from the 
mollusc Rocellaria grandis or the squid Ommastraphes, or earthworm luciferase, which 
produce light with oxygen metabolites in a way very similar to Pholas dactylus. 

The BOIP, apo-BOIP, or nucleic acid coding for it, whether modified or not, may 
therefore be used in a range of biology and investigations such as: 

(a) Detection, location and measurement of signals in substrates, such as live cells, 
30 organs or organisms, or in extracellular fluids; 

(b) Detection, location and measurement of oxygen and its metabolites in substrates, 
such as live cells, organs or organisms, or in extracellular fluids, water (sea and 
fresh), soil or the atmosphere; 



Ill 



14 

(c) Detection and location of normal cells such as microbes (protozoa, yeast, fungi, 
moulds, bacteria, viruses); 

(d) Detection and location of abnormal cells, such as cancer cells, hyperactive cells in 
rheumatoid arthritis and other inflammatory diseases, cells infected with a 

5 pathogen, such as a virus or other infectious agents, cells damaged by physical, 

chemical or biological attack, cells damaged by perfusion or reperfusion injury or 
cells damaged by oxygen or one of its metabolites; 

(e) Measurement and location of enzymes, particularly those producing oxygen or its 
metabolites, and other tumour reactions in cells or biological fluids; 

1 0 (f) DNA and RNA binding assays; 

(g) Immunoassay and other protein binding" assays; 

(h) In genetic engineering, in the development of transgenic animals and plants, and 
microbes; in horticulture; agriculture; medicine and veterinary medicine; and/or 

(i) in genetic entertainment by incorporation into light sticks, greeting cards or toys to 
15 produce light of various colour, intensities, oscillations, flashes and glows; or in 

comestibles, such as food, drinks, including beers, wines ? spirits, colas and other 
soft drinks. 



20 Accordingly, the present invention further provides an apoprotein, such as pholasin 
apoprotein (or apopholasin) in both unglycosylated and glycosylated forms, and a BOIP 
thereof, such as pholasin, either alone (but excluding native proteins per se that have 
already been isolated, such as native pholasin per se) or in association with one or more 
of: a targeting or signal peptide; a glycosylate; a site capable of modification by an 

25 enzyme; an antigen or intracellular signal binding site; a promoter, an enhancer or an 
oncogene or a pharmacologically active substance; or the like. The present invention 
further provides a recombinant construct comprising a nucleic acid sequence encoding for 
any of these proteins; a vector containing a nucleic acid sequence encoding for any of 
these proteins; a host transformed by such vector; ^a live cell, such as bacterial, insect, 

30 eukaryotic, prokaryotic, archae or plant cells containing or expressing any of these 
proteins; and a rainbow protein, as described herein, together with a nucleic acid 
sequence encoding therefor. 



The present invention will now be illustrated with reference to the following non-limiting 
examples, in which the methodology referred to is known to those skilled in the art and/or 
may be carried out by analogy with reference to the protocols disclosed in the following 
references, the contents of which are herein incorporated by reference in their entirety: 
5 BOOKS 

1. Campbell,AK. (1988). Chemiluminescence: principles and applications in biology 
and medicine, pp608. Horwood/VCH, Chichester and Weinheim. 

2. Campbell,AK. (1994). Rubicon: the fifth dimension of biology, pp 304. Duckworth, 
London. 

10 PAPERS 

1. Campbell,AK Patel,A. (1983) Biochem J. 216:185-194. A homogeneous 
i^k immunoassay for cyclic nucleotides based on chemiluminescence energy transfer. 

m 2. Roberts,PA KnightJ Campbell,AK. (1987) Anal. Biochem. 160:139-148. Pholasin - a 

y bioluminescent indicator for detecting activation of single neutrophils. 

U 15 3. Mueller,T DaviesJEV CampbeIl,AK. (1989) J. Biolum. Chemilum. 3:105-113. 

*p Pholasin- Chemiluminescence Detects Mostly Superoxide Anion Released from 

W Activated Human Neutrophils. 

5 4. MuellerJ Campbell,AK. (1989) J. Biolum. Chemilum. 5:25-30. The Chromophore of 

%1 - Pholasin: A Highly Luinescent Protein. 

J-{ 20 5. Sala-Newby,G and Campbell,AK (1991) Biochem.J. 279:727-732. Engineering a 

2 bioluminescent indicator for cyclic AMP dependent protein kinase . 

6. Campbell, AK, Trewavas, AJ and Knight, MR (1996). Calcium imaging shows 
differential sensitivity to cooling and communication in luminous transgenic plants. 
Cell Calcium 19: 211-218. 

25 7. Kendallj JM, Sala-Newby, G Badminton, M, Campbell AK and Rembold, CR (1996). 

Free Ca 2+ in the endoplasmic reticulum of living cells measured using aequorin as a 
pseudo luciferase. Biochem.J 318:383-387 

8. Badminton, MN, Campbell, AK and Rembold, CR (1996). J.Biol. Chem. 271:31210- 
31214. Differential regulation of nuclear and cytosolic Ca 2+ in HeLa cells. 

30 9. Sala-Newby, GB, Taylor, KT, Badminton, MN, Rembold, CR and Campbell, AK 
(1998). Imaging bioluminescent indicators shows Ca 2+ and ATP permeability 
thresholds in, live cells attacked by complement. Immunology. 93:4:601-609 

lO.Sala-Newby, GB, Kendall, JM, Jones, H, Taylor, KM, Badminton, MN, Llewellyn, 
DH and Campbell, AK (1999). Targeting bioluminescent proteins to defined 
35 compartments fo living cells. Methods in Enzymology Biotuminescence and 
Chemiluminescence. ed Ziegler, M and Baldwin, T. 
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EXAMPLE 1: Production of a BOIP in bacteria 

c or genomic DNA coding for apopholasin, with or without the cDNA coding for the signal 
peptide, is amplified by PCR with restriction sites such as BamHl at each end. The cDNA 
is run on an agarose gel and the full length DNA eluted and purified. The DNA is then cut 
with BamHl to generate sticky ends and ligated into an expression plasmid such as pET3a, 
which has been cut with BamHl also. After ligation the sealed piasmid is transformed into a 
standard E.coli K12 strain such as JM109, a colony picked off for a large plasmid 
preparation. After checking that the plasmid contains the correct sequence for apopholasin 
and is in the correct orientation the plasmid is then used to transform a standard expression 
strain of E.coli such as BL21(DE3) or other expression strain. A colony is picked off the 
agar plate and grown up for 2h in standard LB broth. IPTG is added as inducer for a further 
2h. Apopholasin can then be extracted by breaking the bacteria by lysozyme digestion or 
sonication in a standard salt medium such as 50mM HEPES pH 7 +/- ImM ascorbate. Since 
the apopholasin is unglycosylated it tends to aggregate and form inclusion bodies. These 
can be broken using 8M urea or guanidinium chloride and these then dialysed off. If the pH 
of PAGE gels is alkaline this also tends to allow aggregation of both the unglycosylated and 
glycosylated apo- and full pholasin. A signal peptide such p-lactamase signal will target the 
BOIP to the periplasmic space, resulting in the ability to secrete the expressed protein from 
the external fluid of the cells. 

EXAMPLE 2: Production of a BOIP in insect cells 

c or genomic DNA coding for apopholasin is inserted into a plasmid suitable for 
conversion into baculovirus when transfected into insect cells. Since pholasin is secreted 
by Pholas itself there is a signal peptide at the N-terminus. Removal of this by PCR will 
allow cytosolic expression in insect cells. If the signal peptide is left on or changed for 
honey bee mellitin signal peptide, the apopholasin is secreted into the external medium. 
The virus containing the DNA for apopholasin is then purified and stored until required. 
An aliquot is then added to fresh insect cells and these incubated for 3-7 days. The 
apopholasin is then isolated from the supernatant if a signal peptide is used, or from the 
cells is not. The apopholasin can then be purified by ammonium sulphate precipitation, 
gel filtration and DEAE chromatography. The state of glycosylation can be assessed by 
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running the protein on PAGE when the molecular weight is 34Kda. Removal of the 
glycosylation by enzymes returns the protein to the size of apopholasin 23.5Kda. It can 
be stored frozen or freeze dried, and activated to form pholasin by addition of luciferin as 
described in Example 3. 

Since the apopholasin tends to aggregate in the insect supernatant it is important to get the 
protein into non-aggregating buffer, e.g. 50mM HEPES pH 6, 1-1 OmM ascorbate, as soon 
as possible. 

Formation of pholasin can then be achieved as described in Example 3. 
EXAMPLE 3: Generating pholasin and light emission 

To generate light the apopholasin must first be converted into pholasin with the luciferin. 
The luciferin can be extracted from native pholasin by mild acid, or by methanol, mild 
acid or alkaline treatment of light organs isolated from Pholas dactylus or the whole 
organism. After homogenisation the extract is centrifuged or filtered to remove 
particulate material. Further purification can be achieved by tic of hplc. The luciferin is 
best stored dry, but can be stored at -70°C. The intactness and concentration can be 
estimated by measuring the absorbance or fluorescence. The details are as follows: 

(a) Isolation of the luciferin 

Four protocols (1-4) have been developed to extract and isolate the luciferin 
responsible for light emission in pholasin. The luciferin is a small organic moiety 
tightly bound to apopholasin when pholasin is isolated from Pholas dactylus, but 
also can be found not bound to apopholasin. Thus the extraction procedure isolates 
either form of luciferin. 

1. The organism Pholas dactylus or its light organs are homogenised in 
50mM sodium phosphate pH 6.0 on ice. The pholasin is precipitated with 
saturated ammonium sulphate (4°C stirred), and then removed by 
centrifugation at ca 15,000g for 30min in the cold. The supernatant is then 
passed down a SEP-PAK silica column, which binds the luciferin. The 
column is washed with 5ml ethyl acetate and then 5ml of methanol. The 
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active fractions containing the luciferin are assayed either by reactivation 
of the apopholasin or by chemiluminescence in DMSO, DMF, or NaOCL 
The luciferin is concentrated and can be purified further on tic or hplc with 
a standard solvent. It is dried and stored at -70°C. 

The organism Pholas dactylus or its light organs are homogenised in cold 
acetone on ice, filtered through a Buchner funnel, and extracted with 
methanohacetone (1:1), the residual powder being extracted 3 times with 
methanol and extracts combined. These are then concentrated in a 
Rotavaporator and left to stand for lh on ice to allow further precipitation. 
The suspension is then refiltered and concentrated. The solution containing 
the luciferin is then passed down a SEP-PAK silica column which binds 
the luciferin. The column is washed with 5ml ethyl acetate and then 5ml of 
methanol. The active fractions containing the luciferin are assayed either 
by reactivation of the apopholasin or by chemiluminescence in DMSO, 
DMF, or NaOCL The luciferin is concentrated and can be purified further 
on tic or hplc with a standard solvent. It can be dried and stored at -70°C. 
The organism Pholas dactylus or its light organs are homogenised in cold 
acetone on ice, and filtered through a Buchner funnel to give an acetone 
powder. This is then extracted with methanol:acetone (1:1), twice for 
lOmin and then 3 times with methanol. The extracts are combined and 
concentrated in a Rotavaporator. They are left to stand for lh on ice to 
allow further precipitation, refiltered and concentrated. The residual 
powder is resuspended in 50mM sodium phosphate pH 6.0, lOmM 
ascorbate, and ultrafiltered with a lOkD Amicon membrane at 4 C for 
pholasin. The solution containing the luciferin is then passed down a SEP- 
PAK silica column which binds the luciferin. The column is washed with 
5ml ethyl acetate and then 5ml of methanol. The active fractions 
containing the luciferin assayed either by reactivation of the apopholasin or 
by chemiluminescence in DMSO, JDMF, or NaOCL The luciferin is 
concentrated and can be purified further on tic or hplc with a standard 
solvent. It is dried and stored at -70°C. 

The organism Pholas dactylus or its light organs are homogenised in 
50mM HEPES buffer, with methanol and lOOmM HC1 on ice, ancV 
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incubated for 2h on ice. After centrifiigation at ca 15,000g for 30min in the 
cold, the supernatant is then passed down a SEP-PAK silica column which 
binds the luciferin. The column is washed with 5ml ethyl acetate and then 
5ml of methanol. The active fractions containing the luciferin are assayed 
either by reactivation of the apopholasin or by chemi luminescence in 
DMSO, DMF, or NaOCl. The luciferin is concentrated and can be purified 
further on tic with a standard solvent. It is dried and stored at -70°C 
Method 4 normally generates most luciferin. The luciferin is characterised by its 
absorbance and fluorescence spectrum, and by its chemiluminescence with 
DMSO, NaOCl and apopholasin. 
(b) Generation of pholasinfrom apopholasin and the luciferin 

A small sample of the luciferin (l-lOjd) is added to apopholasin in an appropriate 
buffer (50mM HEPES pH 6-7.5, +/- 0.1% gelatine, +/- l-10mM ascorbate, or 
500mM NaCl, lOmM TES, ImM EDTA, ImM mercaptoethanol pH 6-7.5). The 
mixture is incubated at room temperature for up to 24h, and the pholasin assayed 
by adding an oxygen metabolite, e.g. NaOCl, or luciferase to a sample. When 
apopholasin has been expressed in cells, the luciferin is added externally, 
microinjected into individual cells or added via liposomes to get the luciferin into 
the cell 

Light is detected and quantified in a standard luminometer, imaging camera 
(intensified or CCD), or by a silicon chip. 

EXAMPLE 4: Production of a BOIP in vitro 

c or genomic DNA coding for apopholasin, with or without the signal peptide, is 
amplified by PCR with the 5' primer containing the DNA coding for T7 RNA 
polymerase. The DNA product is purified and precipitated. After dissolving in lOmM 
tris/lmMEDTA pH7, the DNA is added to a standard in vitro transcription/translation 
system such as rabbit reticulocyte Iysate or wheat germ agglutinin and incubated at 30°C 
for 30-60min. The apopholasin can then be purified and activated to form pholasin as 
described in Example 3. 
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EXAMPLE 5: Targeting a BOIP in vitro 



The BOIP can also be incorporated into a defined part of a live cell by chemical means or 
by genetically engineering the BOIP to contain a signal peptide which locates the BOIP to 
the inner or outer surface of the plasma membrane or within a particular organelle such as 
peroxisome, mitochondrion, chloroplast, tonoplast, endoplasmic reticulum, Golgi, 
endosome, lysosome, secretory vesicle, nucleus, nucleolus, proteosome, or gap junction, 
or structure such as microtubule, cytoskeleton, nuclear skeleton, nuclear receptor, or 
mitotic spindle. The signal peptide, added either chemically or genetically, will normally 
target the normal or altered BOIP to a particular intra- or extra-cellular site for example, 
the sequence MLSRLSLRLLSRYLL or part of cytochrome oxidase on the N-terminus 
will, target the BOIP to the mitochondrion; KKSALLALMYVCPGKADKE or 
MLLPVPLLLGLLGLAA or the ER protein calreticulin at the N-terminus will target the 
BOIP to the endoplasmic reticulum, a KDEL or HDEL sequence at the C-terminus 
retaining it there. SKL at C-terminus targets BOIP to the peroxisome, PKKKRKV or an 
extension of this SV40 large T-antigen signal will target it to the nucleus, and a 
palmitoylation and/or a myristoylation signal (MGCVCSSNPD = the LCK N-terminal 
acylation motif from tyrosine kinase) will target it to the plasma membrane. By coupling 
the BOIP to another protein which targets itself to a particular site then the BOIP is also 
targeted here. For example, coupling the nuclear proteins nucleoplasmin or lamin B 
receptor to BOIP targets it to the nucleus; cytochrome oxidase at the N-terminus targets 
BOIP to the mitochondria; chlorophyll at the N-terminus targets BOIP to the chloroplast; 
and a connexin at the N-terminus targets BOIP to the gap junction or plasma membrane, 
SNAP 25 to the plasma membrane. 

In order to target pholasin to defined sites in living cells, the DNA coding for these 
targeting sequences are added by using PCR. For cytosolic apopholasin the native signal 
peptide is removed and also the BOIP can be linked to larger proteins at the N- or C- 
terminus such as firefly luciferase or aequorin to prevent it getting into the nucleus. This 
also enables ATP and oxygen metabolites, or Ca 2+ and oxygen metabolites to be measured 
simultaneously in the same cells by intensity, colour or polarisation of the different 
bioluminescent indicators. A multiple bioluminescent indicator can also be engineered by 
PCR, or by using restriction enzyme sites, from the DNA coding for 3 or more 
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bioluminescent proteins. A simple screen of the transformed bacteria enables the multiple 
rainbow protein to be isolated with 2-3 colours or more. 

The DNA is then added to an in vitro transcription/translation system as described in 
Example 4 in the presence of the organelle to be targeted (e.g. microsomes for the 
endoplasmic reticulum, which glycosylate apopholasin). 

The new DNA can also be inserted into a plasmid by standard techniques, and 
transformed into bacteria or tfansfected or injected into eukaryotic cells such as HeLa or 
COS. 

Addition of the luciferin as described in Example 3 allows formation of pholasin which 
can then be detected by light emission. Changes in oxygen metabolite production are then 
be detected by a luminometer or imaging camera when the cells are exposed to external 
oxygen metabolites, a change in oxygen concentration, addition of stimuli e.g. TNF, EGF, 
hormones or drugs, or attack by pathogens such as bacteria, viruses, complement, 
antibodies, toxins, and cells of the immune system. 

EXAMPLE 6: Engineering a covalent modification site into a BOIP 

(a) The site coding a protein kinase A (RRAS or kemptide), protein kinase C 
(MARCKS), MAP kinase, ERK, the ER - nuclear signalling kinase ERE1P or a 
phosphatase is added to the N- or C-terminus or inserted at various sites within the 
apopholasin by PCR and expressed as described in Examples 1-5. Pholasin is 
then generated by addition of the luciferin as described in Example 3. 

Addition of the catalytic subunit for protein kinase A, or activation via cyclic 
AMP inside cells, leads to phosphorylation or dephosphorylation of the modified 
pholasin and change in light emission (intensity, colour or polarisation). 

A preliminary screen is necessary to select the appropriate proteins and to discard 
any which have lost all activity. 
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(b) The site coding a protease (thrombin, enterokinase, HIV protease, caspase) is 
added to the N- or C-terminus of the apopholasin by PCR or inserted at various 
sites within the protein, and expressed as described in Examples 1-5. Pholasin is 
then generated by addition of the luciferin as described in Example 3. 

5 

EXAMPLE 7: Engineering a BOIP into a "Rainbow Protein" 

cDNA coding for apopholasin is linked to another protein by using the cDNA coding for 
that protein. For example, wild type GFP, the S65T mutant of the green fluorescent 
10 protein, YGFP, or EGFP are linked to apopholasin by PCR at the N- or C-terminus, or by 
splitting one or both proteins using multi-step PCR. In between there is a 'reactive' 
peptide with a protease site (a thrombin or enterokinase) and a binding site for IP 3 , or the 
15 amino acid sequence form IP 3 kinase (an IP 4 binding site). At the C-terminus of the 
GFP, a peptide containing 6 lysine residues may also be added via PCR. The protein is 
15 expressed and fluorescein covalently linked to these lysines by addition of fluorescein 
isothiocyanate. Addition of the luciferin forms pholasin as described in Example 3, The 
change in colour occurs by chemiluminescence resonance energy transfer. Without 
fluorescein the rainbow protein emits blue-green light (508nm), which changes to blue 
(490nm) when the reactive substance binds to the reactive peptide, or when either 
20 thrombin or enterokinase is added. When the 6 amino acid linker is used the colour starts 
as green (530nm), and will then change from green, to blue-green and then blue as the 
particular reactive sequence binds their respective analytes. Use of rhodamine instead of 
fluorescein generates a rainbow protein which changes from red to green to blue. 

25 A preliminary screen is necessary to select the appropriate rainbow proteins and to discard 
any which have lost all activity. 

The other protein linked to apopholasin may be, for example, any one of the following 
linked chemically or genetically: 

30 

1. Firefly or any benzothiazole luciferase to the N or C terminus gives two colours 
for ATP and oxygen metabolites. 
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2. Any imidazolopyrazine luciferase, including coelenterazine systems - decapod 
shrimp, fish, sqiud, Renilla, anthzoan, Chaetognate, radiolarian, or copepod and 
Vargula systems - ostracod, Porichthys and similar fish, cypridinids and Vargula. 

3. Any tetrapyrrole luciferase such as dino flagellate, euphausiid or stomiatoid fish. 

4. Bacterial luciferase and other aldehyde or flavin luciferases, including polychate 
worm. 

5. Any GFP, including wild type, S65T, enhanced GFP, blue GFP, yellow GFP, 
Renilla GFP, Pilocarpus GFP, and Pennatula GFP, any anthozan GFP, or any 
coelenterate GFP. 

6. The red fluorescent proteins from stomiatoid fish - Malactosteus, Aristostomias, 
Photostomias \ 

7. The phycobiliproteins - phycoerythrin and phycocyanobilin. 

8. The blue fluorescent lumazine protein in the bacterium Photobacterium. 

9. The yellow flavin fluorescent protein in Y Vibrio. 

10. Any lysine or argininine or other amino acid side chain where a fluor can be added 
covalently. In which the case the rainbow protein amy emiot more than two 
colours. For example, rhodamine on a pholasin-linker-GFP chimera will turn from 
red to green to blue. 

A preliminary screen may be necessary to select chimeras which have not lost all 
bioluminescent activity. 

The 'reactive' peptide may be a binding site for any analyte, protein or DNA, metabolite, 
substrate vitamin, an enzyme such a protein kinase C or phosphatase, ion channel, ion 
pump, antigen, antibody, nucleotide or nucleoside such as ATP, GTP, ADP, AMP, 
adenosine, cAMP, cCMP, cCCP or their deoxy equivalents, and inositol phosphates such 
as IP 3 or IP 4i a lipid such as diacyl glycerol, phosphatidyl inositol bisphosphate, 
phosphate, a cation such as Ca 2+ , K + or Na\ Cu 2+ or Zn 2+ , or anion such as CI" , sulphate, 
or gas such as NO, 0 2 or H 2 , or a protein binding site, such as calmodulin, kinesin, dynein, 
tubulin, or myosin. 

When pholasin is triggered by oxygen metabolites, the Pholas luciferase or peroxidase, 
energy transfer occurs from pholasin oxyluciferin through GFP to fluorescein resulting irv 
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a yellow emission. Addition of thrombin for 3h cleaves the GFP-fluorescein from the 
pholasin and the light emission returns to the blue of native pholasin. Addition of IP 3 to 
the full chimera alters the efficiency of energy transfer. As a result there is a change in 
the ratio of light emitted in the yellow to blue. This ratio is directly related and can be 
plotted against the concentration or amount of analyte. The light is detected in a dual 
wavelength luminometer or ratiometric imaging camera and the ratio of blue to green 
light measured. 

Alternatively any fluors can be used, and any binding sites with the right characteristics as 
shown in these examples will work provided a simple screen is used to select the right 
chimeras. 

EXAMPLE 8: Engineering a BOIP into a "Rainbow Protein" for two analytes 
together 

Apopholasin is linked to firefly luciferase by using cDNAs and PCR, followed by 
expression in insect cells as described in Example 2. Addition of the luciferin as 
described in Example 3 generates the pholasin. In the presence of firefly luciferin (ImM), 
ATP and oxygen metabolites, this chimera emits blue and yellow simultaneously which 
can be independently measured by using a dual wavelength luminometer or imaging 
camera. 

EXAMPLE 9: Expression of BOIPs in mammalian cells 

Apopholasin, c or genomic, in an expression plasmid with the CMV promoter, is 
transfected into HeLa cells. After incubation for 3 days to allow expression of the 
apopholasin, the luciferin is added to form pholasin. Expression is checked using a 
polyclonal antibody to pholasin raised in rabbits. Addition of oxygen metabolites outside 
the cell allows the permeability of the plasma membrane to oxygen metabolites to be 
assessed. As the oxygen metabolites permeate into the cytosol, the light emission 
increases. 
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EXAMPLE 10: Expression of BOIPs in plants 

c or genomic DNA coding for apopholasin is inserted into a plasmid with the cauliflower 
mosaic virus promoter and transformed into Agrobacterium tumifwans. These are then 
added to a tobacco leaf, seedlings generated, and those expressing apopholasin selected. 
The plants are grown to seed, and seedlings grown from this seed. Addition of luciferin 
forms the pholasin as described in Example 3. Stressing the plant, e.g. with wind, touch, 
cold, or peroxide, or during growth and development or by a hormone, generates light, 
showing the formation of oxygen metabolites within the live plant. A cell-specific 
promoter engineered on to the apopholasin cDNA before making the transgenic plant 
enables oxygen metabolites to be detected in specific cells within the whole, living plant. 

EXAMPLE 11: Detection of oxidative damage in vitro 

Addition of pholasin to serum or plasma from a rat, mouse or human enables oxygen 
metabolites to be detected and measured on addition of a drug or other substance of 
interest. 

EXAMPLE 12: Detection of ROMs in a heart cells 

Reperfusion has been proposed to lead to oxygen metabolite damage in cardiac myocytes. 
Pholasin allows this to be tested for the first time. Plasmid containing apopholasin cDNA 
and the CMV promoter is transfected into isolated cardiac myocytes in culture. 
Expression occurs within 1-3 days, and pholasin is formed by addition of the luciferin as 
described in Example 3. Subjecting the cells to hypoxia followed by readmission of 
normal oxygen leads to light emission, showing that oxygen metabolites have been 
generated inside the cells. By using an imaging camera, the digital or analogue nature of 
this can be assessed as the number of cells emitting light can be visualised and counted. 



26 

EXAMPLE 13: Detection of ROMs in the nucleus and endoplasmic reticulum (ER) 



Plasmid-containing apopholasin cDNA with either nucleoplasm^ DNA or calreticulin 
DNA (with or without KDEL on the C-terminus) linked to the pholasin DNA, to target 
the apopholasin to the nucleus or ER respectively, and the CMV promoter for expression, 
is transfected into HeLa cells in culture. Expression occurs within 1-3 days, and pholasin 
is formed by addition of the luciferin as described in Example 3. Addition of oxygen 
metabolites outside the cells, or hypoxic/oxygen shock generates light measured in a 
luminometer, showing how fast oxygen metabolites penetrate into these organelles. By 
imaging with a photon counting imaging camera, the number of cells permeable to 
oxygen metabolites can be counted. Location of the pholasin can be assessed by imaging 
live cells, or by using immunofluorescence with the pholasin antibody on partially-fixed 
cells or GFP-pholasin in live cells. Using a rainbow protein, two or more analytes can be 
detected together. 

EXAMPLE 14: Use of pholasin as a protein label 

Pholasin can be used as a label in homogeneous or heterogeneous immunoassay. 
Apopholasin is first covalently linked to an antibody to HIV, and pholasin formed by 
addition of luciferin as described in Example 3. The antibody is then used in a standard 
chemiluminometric immunoassay format. Addition of HIV antigen leads to an increase 
in antibody binding and an increase in light emission dependent on the amount of HTV 
added. The amount of HIV in a blood sample can be assessed by relating the pholasin 
light emission in the sample to the standard curve. 

EXAMPLE 15: Pholasin as a DNA label 

Apopholasin is covalently linked to an oligonucleotide probe for detecting the presence of 
the cystic fibrosis gene. Addition of the probe to DNA in a standard Southern blot allows 
the probe to bind when the gene is present. Addition of luciferin as described in Example 
3 allows the pholasin to form. Addition of hypochlorite (lOmM) in barbitone buffer pH 9 
causes the pholasin to flash and the gene can be visualised by the photon counting 
imaging camera. * 
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EXAMPLE 16: Pholasin in a two hybrid system 

Protein-protein interaction can be detected by engineering apopholasin to one half of a 
5 two hybrid system and GFP to the other. Binding will allow the yeast to grow. 

EXAMPLE 17: Pholasin in genetic entertainment 

Pholasin is able to chemiluminesce at a wide range of pH (3-10), including acid pH such 
10 as 3-4. Thus it can be added to drinks such as beer, cola, soft drinks, and spirits to make 

them glow. It can also make food glow by adding to them to the ingredients of cakes, 
% icing, popcorn; by painting the pholasin or apopholasin on to the food, or by making it 

fi genetically in the source of the food. It can be used in a wide range of toys and other 

entertaining devices including squirt guns, greeting cards, pens. 

Lp 15 

1 ^ The rainbow proteins can also be used as an alternative to pholasin alone, resulting in a 

tf rainbow of colours and colour changes. 

y % 

f3 EXAMPLE 18: Pholasin in trangenic animals 

K 20 

Transgenic animals such as nematodes, mice or plants can be generated from apopholasin 
cDNA by standard techniques. Injecting the luciferin or incubating whole plant in it forms 
the active pholasin. Oxygen or its metabolites can then be detected, measured and imaged, 
in an intact organ, or from the whole organism. It can also be used in humans, in DNA 
25 therapy or diagnosis. 

EXAMPLE 19: Apoprotein from the luminous squid Ommastrophes 

The use of apoprotein from the luminous squid Ommastrophes is substituted for 
30 apopholasin, and the methods of Examples 1 to 18, above, are carried out. 

EXAMPLE 20: Apoprotein from the mollusc Rocellaria 
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The use of the apoprotein from the mollusc Rocellaria is substituted for apopholasin, and 
the methods of Examples 1 to 18, above, are carried out. 



EXAMPLE 21: Earthworm luciferase 

The use of earthworm luciferase as a BOIP is substituted for apopholasin, and the 
methods of Examples 1 to 18, above, are carried out. 



EXAMPLE 22 



Genomic DNA from Pholas, Rocellaria, Ommastrophes, or earthworm is substituted for 
the recombinant protein in Examples 1 to 18, above, the methods of which are carried out 
in an analogous manner. 
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1 . An isolated, purified or recombinant nucleic acid sequence comprising: 

(a) a sequence that encodes the apophotoprotein of pholasin (alternatively, 
5 'apopholasin'); 

(b) a sequence substantially homologous to or that hybridises to sequence (a) under 
stringent conditions; or 

(c) a sequence substantially homologous to or that hybridises under stringent 
conditions to the sequence (a) or (b) but for the degeneracy of the genetic code; or 

1 0 (d) an oligonucleotide specific for any of the sequences (a), (b) or (c) 

PROVIDED THAT such homologous sequences according to (b) or (c) encode a protein 
capable of binding to luciferin. 

2. A sequence according to claim 1, wherein the sequence that encodes for 
apopholasin is as shown in Figure 4B. 

15 3. A sequence according to claim 1, wherein the: sequence that encodes for 
apopholasin is as shown in any one of Figuresl, 2 f 3, 4A, 6 or 9. 

4. A sequence according to any preceding claim, wherein the apopholasin is ncn- 
glycosylated. 

5. A sequence according to any preceding claim, wherein the apopholasin is 
20 glycosylated. 

6. An isolated, purified or recombinant construct incorporating a sequence encoding 
an apophotqprotein whose expression in a substrate, in association with a luciferin 
therefor> signals the presence of oxygen or an oxygen metabolite in the substrate. 

7. An isolated, purified or recombinant construct incorporating a sequence encoding 
25 an apophotoprotein whose expression in a substrate* in association with a luciferin 

therefor, signals the presence of oxygen or an oxygen metabolite in the absence of a 
corresponding luciferase in the substrate. 

8. A construct according to claim 6 or claim 7, wherein the apophotoprotein is 
apopholasin, 

30 9. A recombinant construct according to any one of claims 1 to 8, wherein the 
nucleic acid sequence is linked operably with nucleotides enabling expression and 
secretion of the apopholasin in a cellular host, 
1 0. DNA or RNA according to any of claims 1 to 9. 
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4 11. Ati isolated, purified or recombinant polypeptide comprising apophotoprot ^- Uo^oooo • 

^ pholasin (apopholasin), or & mutant or variant thereof; which mutant or variant is capable 

of binding to luciferin. 

12. An issolated> purified or recombinant polypeptide according to claim 11 
5 comprising the amino acid sequence of Figure 4 or Figure 5« 

13. - The apopholasin according to claim 11 or claim 12 when expressed by 
recombinant DN A or RNA according to claim 1 0, 

14. The apopholasin according to claim 13, which is non-glycosylated. 

15. A cell, plasmid, virus or live organism having incorporated expressibly therein a 
10 sequence according to any one of claims 1 to 10, whereby it is capable of producing an 

apoprotein. 

1 6. A vector comprising a sequence according to any one of claims 1 to 10, 
O 1 7. A host cell transformed or transfected with a vector according to claim 1 6. 

18, A biolmninescent oxidative indicator protein {BOIP), comprising an 
15 apophotoprotein according to any one of claims 1 1 to 14*in association with a luciferin.. 

19. A BOIP according to claim 18, wherein the luciferin is derived from Pholas 
dactylus. 

20, A method for the preparation of a biolmninescent oxidative indicator protein 
(BOIP), which method comprises bringing an apophotoprotein into association with a 

20 luciferin therefor. 

21. A method for the detection and/or measurement of oxygen or one of its 
metabolites extracellularly, which method comprises providing a bioluminescent 
oxidative indicator protein (BOIP) exttacellularly and thereafter detecting and/or 
quantifying light emission therefrom and/or changes in colour, intensity and/or 

25 polarisation of emission(s), wherein the apophotoprotein comprises recombinant 
apopholasin. 

22. A method for the detection and/or measurement of oxygen or one of its 
metabolites in live cells (intracellularly), which method comprises providing a BOIP 
intracellular^ and thereafter detecting and/or quantifying light emission therefrom and/or 

30 changes in colour intensity and/or polarisation of emission(s) therefrom. 

23, A method according to claim 21 or 22, wherein said BOIP is selected from native 
or chemically- or genetically- modified BOIP or a 'rainbow protein' based on such a 
BOIP. 
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24. A method according to any axis of claims 21 to 23 . wherein said BOP btauusa 
a signal peptide, targetting it to a predetermined extra- or intra- cellular site. 

25. A method according to any one of claims 21 to 23, comprising incubating a test 
sample with a cell according to claim 15 or with a membrane preparation derived 

5 therefrom. 

26. A method according to any one of claims 21 to 24, wherein light emission takes 
place in the absence of a luciferase. 

27. The use of a sequence or a protein according to any one of claims 1 to 19 in the 
detection, diagnosis or measurement of oxygen or a metabolite thereof 

10 28. A diagnostic kit incorporating a sequence or protein according to any one of 
claims 1 to 19, 

29. A method for obtaining a substantially homologous source of apopholasin, which 
method comprises culturing cells having incorporated expressibly therein a 
polynucleotide encoding apopholasin as defined in any one of claims 1 to 10, and 

15 thereafter recovering the cultured cells, 

30. A method, use or kit according to any one of claims 20 to 29, substantially as 
hereinbefore described with particular reference to the Examples, 
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Clone 40: 

GAATrCGGC&CG&GTCGGAAAAGAACAAAATGGCTTGTATCGTTTTCGTT 
GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGG7GAGGAAGTACA 
ATGCGCGATGAATT GGACACAAGCT AATGAATATGTGTT CAACGTGGACT 
GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAA 
GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
AAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 
GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 
TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 
TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
ACGCTGATGATAAATGCAT CGAAGGCACAATT GT GGT GACAGTCAGGGTG 
TCCCTATATGACGAAGATAACAATGGTGTAATGGATGAAGGTAAGGTGAT 
TC CATCTGAGACAATCGAGGATGAT ATCAAGGACTGT GGGCT CTTAGACC 
AAGATGTTGAACTCGATTATACGTGGACTCAAAACGAGTGTGATCTACCA 
GACACAGTAGACGAGGCTGAAGAC&CACCGTCAGAAACTGGAGAATTCTT 
CTGGTAGATCTATCAGACTACTTTTATCAGCAGGACAACTGGTCGTTACC 
AGACACCTATAACGTGTCCTCATCAATAAT GT GTAAAACAGAAATAATC G 
ATAGAATATTGAAAATAAAATGTTAATAAACACTGGTTGAAATATGAAAA 
AAAAAAAAAAAAAACTCGAG 

Clone 3: 

GAaTTCGGCACGAGGGAAAAGAACAAAATGGCTTGTATCGTTTTCGTT 
GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 
ATGCGCGATGAATT GGACACAAGCTAATGAATATGTGTTCAACGTGGACT 
GGATGACCAXTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAG 
GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
AAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 
GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 
TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 
TAGCAAAATGCCAGGCACTTACATGCTTAT GGACGT GTGCGCTACAAGGG 
ACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAGGGTG 
T CC CTATAT G ACGAAGATAACAATGGTGTAATGGATGAAGGT AAGGTTAT 
TCCATCTGAGACAATCGAGGATGATATCAAGGACTGTGGGCTCTTAGACC 
AAGATGTTGAACTCGATTAT ACGT GGACTCAAAAC GAGT GT GATCTACCA 
GACACAGTAGACGAGGCT GAAGACAC ACC GTC AGAAACTGGAGAATTCTT 
CT GGTAGAT CTAT C AG ACC ACTTT T AT C A GC AGGAC AACT G GT C GT TAC C 
AGACACCTATAACGTGTCCTCATCAATAATGTGTAAAACAGAAATAATCG 
AT AGAAT ATT GAAAATAA 



Clone 5: 

CTCGGAAAAGAAOIlAAATGGCTTGTATCGTTTTCGTTGCTCTTGTCGCTCTATGCTTAATGCAACCGGG 
TT CCGGT GAG GAAGTACAAT GC GCGATGAATTGGACACAAGCTAAT GAAT ATGT GTTC AACGT GGACTG 

gatgaccattttcatctacgactatggcgctcaagagcaactgtacgaggatcgggctttggggctgtg 
tcggattgaacgggccggcccaggtaccacaaaagccgtctggattaactggagtaacgacacgcagtc 
atgtgtaacaagaaaaacaatcttcttcgaggttggtggagaaattgcccggctagttgactacagacc 
acaggaagacg gaact gagaaaacttttac aagaaaattct ctagc aaaatgccaggc actt acat gct 
tatggacgt gtgcgctacaagggacgctgatgataaat gcatcgaaggcacaattgt ggtgacagt cag 
ggt gt c cctat atg ac gaagataacaatggt gtaat gg at g aaggt aaggtt att cc at ctg agac aat 
c gaggatgat atcaaggactgtgggctcttagac caagat gtt gaact cgatt at acgtggactcaaaa 
c g agt gt gat ctac c ag acac agtagac gagg ct gaagacac ac c gtc ag aaact ggagaat t ctt ct g 
gtagatctatcagaccacttttatcagcaggacaactggtcgttaccagacacctataacgtgtcctca 
t c aataatgtgtaaaacagaaataat c gat agaat att gaaaataaaat gtt aat agac act ggtt gaa 
aaaaaaaaaaaaaaaaaacrosag 




SUBSmUTE SHEET {RULE 26) 



WO 00/28025 



2 / 15 



09/831 

PCT/CB99/03654 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 

clone 4 0 
clone 3 
clone 5 

clone 40 

clone 3 

clone 5 

i : l clone 4 0 

£=- clone 3 

'■■ clone 5 

T)l clone 40 

clone 3 
clone 5 



;p5 _ clone 40 

clone 3 

t2 clone 5 



clone 4 0 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



GaflrrCGGCACGAGTCGGAAAAGAACAAAATGGCTTGrArCGTTTTCGTT 
GAATTCGGCACGAG — GGAAAAGAACAAAATGGCTTGTATCGTTTTCGTT 
GTCGGAAAAGAACAAAATGGCTTGTATCGTTTTCGTT 



GCTCTTGTCGCTCTATGCTTAATGCAACCGGG7TCCGGTGAGGAAGTACA 

GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 

GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTrCCGGTGAGGAAGTACA 

+ + + + * + + * + + + 

ATGCGCGAT GAATTGGACACAAGCT AATGAATAT GTGTTCAACGT GGACT 
ATGCGCGATGAATTGGACACAAGCTAATGAATATGTGTTCAACGTGGACT 
ATGCGCGAT GAATTGGACACAAGCT AATGAATATGTGTTCAACGT GGACT 

GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAA 

GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAG 

GGAT GACCATTTT CAT CTAC GACTATGGCGCT C AAGAGCAACTGTACGAG 
+ + + + + + + + + + + + + + + + + + + + + + + * + 

GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 

GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 

GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
+ 

AAAAGCCGT CTGGATTAACTGGAGTAACGACACGCAGTCAT GTGTAACAA 
AAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 
AAAAGCCGT CTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 

GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 

GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 

GAAAAACAATCTTCTTCGAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 
+ + + + + + * + + + + + + + + + + 

TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 
TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 
TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 

± + * + + +± + ±+ + + + ** + + ± + ± + + + + +±± + + + + + + ++ + ±+ + + + ir** + *+ +'k + 

TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 

* + + + ★ + + + + + * *•****■*#* ++** + 

ACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAGGGTG 
ACGCT GAT GAT AAATGC AT CGAAGGCACAATTGTGGT GACAGTCAGGGT G 
ACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAGGGTG 
+ + •*••*■* + * + + * 

T CCCT AT AT GACGAAGATAACAAT GGTGTAATGGAT GAAGGTAAGGT GAT 

T CC CTATAT GACGAAGAT AACAAT GGTGTAATGGATGAAGGTAAGGTTAT 

T CCCTATAT GACGAAGATAAC AATGGT GT AATGGATGAAGGTAAGGTTAT 

+ + + * +* 

TCC AT CT GAGACAAT CGAGGATG ATAT CAAGGACT GTGGGCTCTTAGACC 
TCCATCTGAGACAATCGAGGATGATATCAAGGACTGTGGGCTCTTAGACC 

TCCATCTGAGACAAT CGAGGATGAT AT CAAGGACTGT GGGCTCTTAGACC 

★ * + # + + *** *>* **** + + + + 



Fig. 2 (Part 1 of 2) 
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clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



clone 40 
clone 3 
clone 5 



AAGATGTTGAACTCGATTATACGTGGACTCAAAACGAGTGTGATCTACCA 
AAGATGTTGAACT CGATTATACGTGGACT C AAAACGAGTGTGATCTACCA 
AAGATGTTGAACT CGATTATACGTGGACT CAAAACGAGTGT GAT CTACCA 



++++++++ +++ + + + + + + + + ++++++ + + + + + + + ± + + + + +++ + + + ilril .^ it 4 rir + 



GACACAGTAGACGAGGCTGAAGACACACC GT CAGAAACTGGAGAATTCTT 
GACACAGTAGACGAGGCTGAAGACACACCGTCAGAAACTGGAGAATTCTT 
GACACAGTAGACGAGGCTGAAGACACACCGTCAGAAACTGGAGAATTCTT 



CTGGTAGATCTATCAGACTACTTTTATCAGCAGGACAACTGGTCGTTACC 
CTGGTAGATCTATCAGACCACTTTTATCAGCAGGACAACTGGTCGTTACC 
CTGGTAGATCTATCAGACCACTTTTATCAGCAGGACAACTGGTCGTTACC 



AGACACCTATAACGTGTCCTCATCAATAATGTGTAAAACAGAAATAATCG 
AGACACCTATAACGTGTCCTCATCAATAATGTGTAAAACAGAAATAATCG 
AGACACCTATAACGTGTCCTCATCAATAATGTGTAAAACAGAAATAATCG 

************* ++++++ * + **** ************************* 

ATAGAATATTGAAAATAAAAT GTTAATAAACACTGGTTGAAATAT GAAAA 
ATAGAATATTGAAAATAA 

ATAGAATATTGAAAATAAAAT GTTAAT AGACACT GGTT GAAA AAA 

****************..**. 

AAAAAAAAAAAAAACTCGAG 



AAAAAAAAAAAAAACrCGAG 
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GA^rrCGGCilCG^GTCGGAAAAGAACAAAATGGCTTGTATCGTTTTCGTTGCTCTTG 

8S 

TCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGZ\AGTAC2\ATGCGCGATGAATT 

GGACACSAGCTAATGAATATGTGTTCAACGTGGACTGGATGACCATTTTCATCTACG 

ACTATGGCGCTCAAGAGCAACTGTACGAAGATCGGGCTTTGGGGCTGTGTCGGATTG 

3A 

AACGGGCCGGCCCAGGTACCACAAAAGCCGTCTGGATTAACTGGAGTAACGACACGC 

AGT CATGT GTAACAAGAAAAACAAT CTT CTT C GAGGTT G GT GGAGAAATTGCCCGGC 

4S 

TAGTTGACTACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCT 
CTAGCAAAAT GCCAGGCACT TACAT GCT TATGGACGTGT GC GCT ACAAGGGAC GCTG 

ATGATAAATGCATCGAAGGCACAATTOTGGTGACAGTCAGGGTGTCCCTATATGACG 
6A 

AAGATAACAAT GGT GTAAT GGAT GAAGGT AAGGT GTT CCAT CT GAGACAAT C GAGGA 

T GATATCAAGGACTGTGGGCTCTTASACC AAGAT GTT GAACTC GAT T ATAC GT GGAC 
7S 

T CAAAACGAGT GT GAT CT AC CAGAC ACAGT AGAC GAGG CT GAAGACAC AC C GT C AGA 
AACTGGAGAATTCTTCTGGTAGATCTATCAGACTACTTTTATCAGCAGGACAACTGG 
T C GTTACCAGACACCTAT AACGT GTCCT CATCAAT AAT GT GTAAAACAGAAATAATC 

GATAGAAT ATT GAAAATAAAAT GTTAATAAACACTGGTTGAAATATGAAAAAAAAAA 

5A 

AAAAAAAACTCGAG 



Fig. 3 
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Ontranslated region 
G&A2TCGGCACGAGTCGGAAAAGAACAAA 
Translated region 

ATG GCT TGT ATC GTT TTC GTT GCT CTT GTC GCT CTA TGC TTA ATG 45 
MACIVFVALVALCLM 

CAA CCG GGT TCC GGT GAG GAA GTA CAA TGC GCG ATG AAT TGG ACA 90 
QPGSGEEVQCAMNWT 

CAA GCT AAT GAA TAT GTG TTC AAC GTG GAC TGG ATG ACC ATT TTC 135 
QANEYVFNV DWMT I F 

ATC TAC GAC TAT GGC GCT CAA GAG CAA CTG TAC GAA GAT CGG GCT 180 
IYDYGAQEQLYEDRA 

TTG GGG CTG TGT CGG ATT GAA CGG GCC GGC CCA GGT ACC ACA AAA 225 
LGLCRXERAGPGTTK 

GCC GTC TGG ATT AAC TGG AGT AAC GAC ACG CAG TCA TGT GTA ACA 270 
AVWIKWSNDTQSCVT 

AGA AAA ACA ATC TTC TTC GAG GTT GGT GGA GAA ATT GCC CGG CTA 315 
RKTIFFEVGGSIA. RL 

GTT GAC TAC AGA CCA CAG GAA GAC GGA ACT GAG AAA ACT TTT ACA 350 
fi VDYRPQEDGTEKTFT 



03 

!= £ 



AGA AAA TTC TCT-AGC AAA ATG CCA GGC ACT TAC ATG CTT ATG GAC 405 
RKFSSKMPGT YMLMD 

GTG TGC GCT ACA AGG GAC GCT GAT GAT AAA TGC ATC GAA GGC ACA 450 
VCATRDADDKCIEGT 

ATT GTG GTG ACA GTC AGG GTG TCC CTA TAT GAC GAA GAT AAC AAT 4 95 

IVVTVRVS L Y DEDNN 

GGT GTA ATG GAT GAA GGT AAG GTG ATT CCA TCT GAG ACA ATC GAG 54Q 
GVMDEGKVI PSETIE 

GAT GAT ATC AAG GAC TGT GGG CTC TTA GAC CAA GAT GTT GAA CTC 585 
DDIKDCGLLDQDVEL 

GAT TAT ACG TGG ACT CAA AAC GAG TGT GAT CTA CCA GAC ACA GTA 630 
DYTWTQNECDLPDTV 

GAC GAG GCT GAA GAC ACA CCG TCA GAA ACT GGA GAA TTC TTC TGG 675 
DEAEDTPSETGEFFW 

TAG ATC TAT CAG ACT ACT TTT ATC AGC AGG ACA ACT GGT CGT TAC *720 



CAG ACA CCT ATA ACG TGT CCT CAT CAA TAA 750 
* - stop for translation 



Fig. 4 A 
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EcoR I 

GAA TrCGGCflCGACTCGGAAAAGAACAAA 

ATG GCT TGT ATC GTT TTC GTT GCT CTT GTC GCT CTA 
TGC TTA ATG CAA CCG GGT TCC GGT GAG GAA GTA CAA 
TGC GCG ATG AAT TGG ACA CAA GCT AAT GAA TAT GTG 
TTC AAC GTG GAC TGG ATG ACC ATT TTC ATC TAC GAC 
TAT GGC GCT CAA GAG CAA CTG TAC GAA GAT CGG GCT 
TTG GGG CTG TGT CGG ATT GAA CGG GCC GGC CCA GGT 
ACC ACA AAA GCC GTC TGG ATT AAC TGG AGT AAC GAC 
ACG CAG TCA TGT GTA ACA AGA AAA ACA ATC TTC TTC 
GAG GTT GGT GGA GAA ATT GCC CGG CTA GTT GAC TAC 
AGA CCA CAG GAA GAC GGA ACT GAG AAA ACT TTT ACA 
AGA AAA TTC TCT AGC AAA ATG CCA GGC ACT TAC ATG 
CTT ATG GAC GTG TGC GCT ACA AGG GAC GCT GAT GAT 
AAA TGC ATC GAA GGC ACA ATT GTG GTG ACA GTC AGG 
GTG TCC 'CTA TAT GAC GAA GAT AAC AAT GGT GTA ATG 
GAT GAA GGT AAG GTG ATT CCA TCT GAG ACA ATC GAG 
GAT GAT ATC AAG GAC TGT GGG CTC TTA GAC CAA GAT 
GTT GAA CTC GAT TAT ACG TGG ACT CAA AAC GAG TGT 
GAT CTA CCA GAC ACA GTA GAC GAG GCT GAA GAC ACA 
CCG TCA GAA ACT GGA GAA TTC TTC TGG TAG 
ATCTATCAGACTACTTTOA 

ACCTATAACGTGTCCTCATCAATAATGTGTA2^AACAfiA^TAATCGA 
TAGAATATTGAAAATAAAATGTTAATAAAC^VCTGGTTGAAA 
AAAAAAAAAAAAAAAAC TCGAG 



Xho I 




4B 



SUBSTITUTE SHEET (RULE 26) 



09/831142 



WO 00/28025 7 / 15 PCT/GB99/03654 



EEVQCAMNWTQANEYVFNVDWMTIFIYDYGA^ 

WINWSNDTQSCVTRKTIFFEVGGEIARLVDYRPQEDGTEKTETRKFSSKMPGTYMLM 
DVCATRDADDKCIEGTIVVTVRVSLYDEDNNGVMDEGKVIPSETIEDDIKDCGLLDQ 
DVELDYTWTQNECDLPDTVDEAEDTPSETGEFFW 

Fig. 5 A 



MACIVFVALVALCIiMQPGSGEEVQC^^ 

DKALGLCRIERAGPGTTKAWINWSNDTQSCVTRfCTIFFEVGGEIARLVDYRPQEDG 
TEKT FTRKFSSKMPGTYMLMDVCATRDADDKC IEGT IWTVRVS L YDEDNNGVMDEG 
KVIPSETIEDDIKDCGLLDQDVELDYTWTQNECDLPDTVDEAEDTPSETGEFFW 



Fig. 5B 
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clone 40 

BioXAct 

zTth 



GAftrrCGGCACGAGT CGGAAAAGAACAAAATGGCTTGTATCGTTTT CGTT 

TGGCTTGTATCGTTTTCGTT 



clone 40 

BioXAct 

zTth 



clone 40 

BioXAct 

rTth 



GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 

GCTCTTGTCGCTCTATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 

TATGCTTAATGCAACCGGGTTCCGGTGAGGAAGTACA 
************************* ************* 

ATGCGCGAT GAATT GGACACAAGCTAATGAATATGT GTTCAACGT GGACT 

ATGCGCGAT GAATT GGACACAAGCTAATGAATATGTGTTCAACGT GGACT 

ATGCGCGATGAATTGGACACAAGCTAATGAATATGTGTTCAACGT GGACT 
************************************************** 



clone 40 

BioXAct 

zTth 



GGATGACCATTTTCATCTACGACTATGGCGCTCAAGAGCAACTGTACGAA 

GGATG ACCATTTTCATCTACGACTATGGCGCT CAAGAGC AACT GTACGAA 

G GATGAC CATTTT CATCTACG ACTATGGC GCT CAAGAGCAACTGTACGAA 
* ********************* *************************** * 



clone 40 

BioXAct 

zTth 



GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 

GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 

GATCGGGCTTTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCAC 
*****^*** ********.************ ******* **** ********** 



Li I 



clone 40 
BioXAct 



AAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 
AAAAGCCGTCTGGATTAACTGGAGTAACGACACGCAGTCATGTGTAACAA 
AAAAG CC GT CTGGATTAACT G GAGTAACG AC AC G CAGTCAT GT GT AACAA 
** * * * ** * ***** 4- * **** * * # * * * * * * * ** + * * ********* * ***** * 



clone 40 

BioXAct 

zTth 



GAAAAACAATCTT CTTC GAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 

GAAAAACAATCTT CTTC GAGGTTGGTGGAGAAATTGCCCGGCTAGTTGAC 

GAAAAACAATCTTCTTC GAGGTT GGT GGAGAAATTGC CCGGCTAGTTGAC 
**.**•*************************.*****************.**** 



clone 40 

BioXAct 

zTth 



TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 
TACAGAC CACAGG AAGACG GAACTG AGAAAACTTTTACAAG AAAATTCT C 
TACAGACCACAGGAAGACGGAACTGAGAAAACTTTTACAAGAAAATTCTC 

************************* ************************* 



clone 40 
BioXAct 



clone 40 

BioXAct 

rrtii 



TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
TAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGGG 
T AGCAAAAT GCC AGGCACT TACATG CTTAT GGACGTGT GCGCTAC AAGG G 
*************************************************** 

AC GCT GAT GATAAATGCAT C GAAGGC ACAATTGTGGTG ACAGTCAG GGTG 

ACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAGGGTG 

ACGCTGATGATAAATGCATCGAAGGCACAATTGTGGTGACAGTCAGGGTG 
************************************************** 



clone 40 

BioXAct 

zTth 



TCCCT ATAT GACGAAGAT AACAATGGTGTAATG GATGAAGGTAAGGTGAT 

T CC CT ATAT GAC GAAGAT AAC AATG GTGT AATG GATGAAGGTAAGGTGAT 

TCCCTATATGACGAAGATAACAATGGTGTAATGGATGAAGGTAAGGTGAT 
*************************************************** 



clone 40 
BioXAct 



TCCATCTGAGACAATC GAGGATGATATCAAGGACT GTGGGCTCTTAGACC 
TCCATCTGAGACAATCGAGGATGATATCAAGGACTGTGGGCTCTTAGACC 



Fig. 6 (Parti of 2) 
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rTth 



clone 40 

BioXAct 

rTth 



clone 40 

BioXAct 

rTth 



clone 40 

BioXAct 

rTth 



clone 40 

BioXAct 

rTth 



clone 40 
BioXAct 



clone 40 

BioXAct 

rTth 



T CCAT CT GAGACAAT CGAGGATGAT AT CAAGGACT GT GG GCT CTT AGAC C 



AAGAT GTT GAACTCGATTAT ACGTGGACT CAAAACG AGTGTGAT CTACCA 

AAGAT GTTGAACT CGATTATACGTGGACTCAAAACGAGTGTGATCTACCA 

AAGATGTTGAACTCGATTATACGTGGACTCAAAACGAGTGTGATCTACCA 
***************** *********+*********************** 

GACACAGTAGACGAGGCTGAAGACACACCGTCAGAAACTGGAGAATTCTT 
GACACAGTAGACGAGGCTGAAGAGACACCGTCAGAAACTGGAGAATTCTT 
GACACAGTAGACG AGGCT GAAGACACACC GTCAGAAACT GG AGAATTCTT 
*********** *************************************** 



CTGGTAGATCTAT C AGACTACTTTT AT CAGCAGGACAACTGGTCGTTAC C 

CTGGTAGAT CT AT C AGACTACTTTT AT C AGC AGGAC AACT G GT CGTTAC C 

CTGGTAKATCTAT CAGACTACTTTT ATCAGCAGGACAACTGGTCGTTAC C 
****** ************** ***************************** 

AGAC ACCTATAAC GTGTCCTCATCAATAAT GTGTAAAACAGAAATAATCG 

AGACAC CTATAACGT GT C CTC AT CAATAAT GT GT AAAACAGAAAT AATC G 

AG ACAC CT AT AACGT GT C CT C ATCAAT AAT GT GT AAAAC 
*************************************** 

ATAGAATATTGAAAATAAAAT GTT AATAAACACTGGTTGAAATATGAAAA 
ATAGAATATTGAAAATAAAAT GTTAATAAACACTGGTTGAAATATGAA 



AAAAAAAAAAAAAA CTCGAG 




6 
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Oligo 1 

ACI ATH TTY TTY CAR GT 
Oligo 2 

CAR GAR GAR GGN ACI GA 
Oligo 2 A 

TCI GTN CCY TCY TCY TG 
Oligo N 

TTY AAY GTI GAY TGG ATG 

M=A/C R-A/G W-A/T S=G/C Y=C/T 

K=G/T 

V-A/C/G H=A/C/T D—A/G/T B=C/G/T N-A/C/G/T 
I=inosine 



Oligo 3A 

ACA CAG CCC CAA AGC CCG AT 
Oligo 4S 

-TTG CCC GGC TAG TTG ACT AC 
Oligo 5A 

CAT ATT TCA ACC AGT GTT TAT TA 
Oligo 6A 

CAA TTG TGC CTT CGA TGC A 
Oligo 7S 

GGA CTG TGG GCT CTT AG 
Oligo 8S 

ATG GCT TGT ATC GTT TTC GT 
Oligo T7 
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Oligo ExS 

CCA CAC GGA TCC TGA GGA AGT ACA ATG 
Oligo ExA 

CCA CAC GGA TCC TTA TTG ATG AGG ACA 
Oligo Bad 

CTT GTT TTT ATG GTC GTC TAG ATT TCT TAC ATC TAT GCG GAG 
GAA GTA CAA TG 

Oligo C9 12 

CCA CAC AGA TCT AGA ATG AAA TTC TTA GTC AAC GTT GCC CTT 
GTT TTT ATG GTC 

Oligo BV5 

TTT ACT GTT' TTC GTA ACA GTT TTG 
Oligo BV3 

CAA CAA CGC ACA GAA TCT AG 
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AccI 630 

AflXXX 405 734 

Alul 95 

AlwNI 659 

Asp 718 215 

Asul 204 209 

BanX 215 

Banll 564 

BcnX 51 310 

BglXI 678 

Bspl 286 564 

BstNX 213 384 

BsttTC 77 

CfrlOI 206 

Cfrl3l 204 209 

Dctel 345 528 565 

DP»I 174 615 680 

EcoRX 665 

EcoRII 211 382 

EcoRV 547 

Fokl 136 518 554 

HaeXX 153 
HaeXIX 206 210 • 

HgaX 431 

Hhal 77 152 413 

HincIX 319 

HxnfX 520 598 

HinPlI 75 150 411 

HpaXX 50 57 207 310 

HphX 71 469 529 

Kpnl 219 

MaeX 314 372 

MaeXX H4 405 593 734 

MaelXX 245 265 457 716 

MboXX 182 274 277 347 

497 653 661 

54 282 531 627 

750 

Msel 41 237 

KaeX 208 

Neil 51 310 

NlaXXX 264 397 

NlaXV 55 217 

Nsil 440 
NspHI 397 
PleX 592 

*saX 69 167 217 

Sau3AX 172 613 678 

Sau96X 204 209 

ScrFX 51 213 310 384 

SfaNX 428 T7ir* i 

TaqX 288 441 537 585 Jl ll£m C 

XhoXI 678 " 
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NlalV 
Mull 
ScxFI 
Neil 
Bcnl 

Msel Kpall Hpall 

ATGGCTTGTATCGTTTTCGTTGCTCTTGTCGCTCTATGCTTAArGCAACCGGGTTCCGOT 60 

Hhal 
HphI BstUI 

Rsal HinPlI Alul Maell 

Mil I | 

G AGGAAGTACAAT GC GCGAT G AATT GGAC ACAAGC T AAT GAAT AT GT G T TC AAC G T GGAC 120 

Haell 

Hhal Dpnl 
FoJcI HinPlI Rsal Sau3AI 

1 III ill 

TGGAT GACC ATT T TC ATC T ACGACT AT G GC GCTCAAGAGCAACT GTAC GAAGAT C GGGCT 180 

Haelll 
Sau96I 
Cfrl3I 
Nael Rsal 
Hpall NlalV 
Haelll BanI 
CfrlOT ScrFI 
Sau96I BstNI 
Cfrl3I EcoRII Kpnl 
Mbo11 Asul Asul Asp718 Msel 

1 i Mill! I I I I ] 

TTGGGGCTGTGTCGGATTGAACGGGCCGGCCCAGGTACCACAAAAGCCGTCTGGATTAAC 240 

Haelll MboII 
Maelll Nlalll MboII Mnll TaqI 

1 n 1 I f i 

TGGAGTAAC GACACGCAGTCATGTGTAACAAGAAAAACAATCTTCTTCGAGGTT GGT GGA 300 

Mael 
ScrFI 
Neil 

Hpall MboII 
Bcnl Hindi odel 

II! i| 
GAAATT GCCC GGCTAGTT GACT ACAGAC CACAGGAAGAC GGAAC TGAGAAAACT TT T ACA 3 60 



Fig. 9 (Part 1 of 2) 
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ScrFI 

BstNI NspHI Maell HinPlI 

Mael EcoRII NlalXI Afllll Hhal 

1 I I I I I [ 

AGaAAATTCTCTAGCAAAATGCCAGGCACTTACATGCTTATGGACGTGTGCGCTACAAGG 420 

Hgal TaqI 
SfaNI Nsil Maelll HphI 

III! I ! 

GAC GCT GAT GAT AAAT GC ATCG AAGGCACAAT T GT GGTGAC AGT C AGG GT GTCCCT ATAT 480 

Mull 
Hinfl KphI 
MboII FoicI Ddel TaqI 

1 1 I II 1 1 ! 

GACGAAGATAACAATGGTGTAATGGATGAAGGTAAGGTGATTCCATCTGAGACAATCGAG 5 40 

Ddel 

Bspl286 Maell 
EcoRV Fofcl Banll TaqI Plel Hinfl 

ill! [ II | 

GATGATATCAAGGAC TGT GGGCTCTTAGACCAAGAT GT TGAACTC GATT ATAC GTGGACT 60 0 

Dpnl AccI 
Sau3AI Mnll MboII AlwNX 

CAAAAC GAGT GT GAT CTACC AGACACAGT AGAC GAGGC T GAAGAC ACAC C GTCAGAAACT 660 

Dpnl 
XhoII 
EcoRI Sau3AI 
MboII Bglll Maelll 

fill | 
QG AGAATT C TT C T GGTAGATC TATC AGAC TAC T T T TAT CAGC AGGACAACT GGT C GT TAG 720 

Maell 

Afllll Mnll 

I I 
CAGACACCTATAACGTGTCCTCATCAATAA 750 



Fig. 9 (Part 2 of 2) 
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Ref: WCM.69.USA 

COMBINED DECLARATION AND POWER OF ATTORNEY 

As a below named inventor, I hereby declare that 

My residence, post office address and citizenship are as stated below next to my name. 

I believe 1 am the original, first and sole inventor (if only one name is listed below) or an original, first and joint 
inventor (if plural names are listed below) of the subject matter which is claimed and for which a patent is 
sought on the invention entitled: 

the specification of which: (check one) 

REGULAR OR DESIGN APPLICATION 

is attached hereto- 



[ 1 

[ 1 



was filed on 



as application Serial No. 



and was amended on 



(if 



applicable). 

PCT FILED APPLICATION ENTERING NATIONAL STAGE 



M 



[X] 



was described and claimed in International application No. 
PCT/GB99/03654 filed on 5 m November 1 999 



1 hereby state that ! have reviewed and understand the contents of the above-identified specification, including the claims, as amended 
by any amendment referred to above, 

I acknowledge the duty to disclose information which is material to patentability as defined in Title 37> Code of Federal Regulations, 
§1-56. 

PRIORITY CLAIM 



□ 



p 



i hereby claim foreign priority benefits under 35 USC 119 of any foreign appiication(s) for patent or inventor's certificate listed below 
and have also identified below any foreign application for patent or inventor's certificate having a filing date before that of the 
application on which priority is claimed. 

PRIOR FOREIGN APPLICATIONS) 



Country 


Application 
Number 


Date of Filing 
(day, month, year) 


Priority 
Claimed 


Great Britain 


9824357.9 


7 m November 1998 


Yes 



(Complete this part only if this is a continuing application.) 

I hereby claim the benefit under 35 USC 120 of any United States application^) listed below and, insofar as the subject matter of each of the claims of 
this application is not disclosed in the prior United States application in the manner provided by the first paragraph of 35 USC 112, I acknowledge the 
duty to disclose information which is material to patentability as defined in Title 37 Code of Federal Regulations §1 .56 which became available between 
the filing date of the prior application and the national or PCT international filing date of this application: 



(Application Serial No.) 
Form YHtT (2/97) 



(Filing Date) 



(Status— patented, pending, abandoned) 
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POWER OF ATTORNEY 

The undersigned hereby authorizes the U.S. attorney or agent named herein to accept and follow instructions from 
WYNNE-JONES, LAINE & JAMES as to any action to be taken in the Patent and Trademark Office regarding this 
application without direct communication between the U.S. attorney or agent and the undersigned, in the event of a 
change in the persons from whom instructions may be taken, the U.S. attorney or agent named herein will be so notified 
by the undersigned. N 
As a named inventor, I hereby appoint the following attomey(s) to prosecute this application and 7 
transact all business in the Patent and Trademark Office connected therewith : Robert J. PATCHMV 
Reg. No. 37.355, Andrew J. PATCH, Reg. No. ^2,925, Robert F. HARGEST, Reg. No. 25,500^ 
Benoit CASTEL, Reg, No. 35^41, Eric JENSEN, Reg. No. 31355, and Thomas W. PERKINS, 
Reg. No. Z^Q27^ do ¥QUNG~£ THOMPSON, SeropdJ^ jpor, 745 South 23rd Street, Ar lington^ 
Wginiaj?22Q2^ all telephone calls to Young & Thompson at 703/521-2297. 

I hereby declare that all statements made herein of my own knowledge are true and that ail 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both under Section 1001 of Title 18 of the United States Code and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon. 

Full name of sole or first inventor: Anthony Keith CAMPBELL 

(given name, family name) s\ 

Inventor's signature f^- F^ f Date JJ^J2^^/ 

Residence: PeDarthrOnited Kingdom ^ y/ Citizenship: British 

Post Office Address:14 Maillard's Haven, Penarth, Vale of Glamorgan, CF65 5RF, United 
Kingdom 
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SEQUENCE LISTING 

<110> CAMPBELL, ANTHONY KEITH 

<12 0> PROTEIN AND DNA CODING THEREFOR 

<130> WCM.69.US 

<140> 09/831,142 
<141> 2001-05-07 

<150> PCT/GB99/03654 
<151> 1999-11-05 

<150> GB 9824357.9 
<151> 1998-11-07 

<160> 42 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 870 

<212> DNA 

<213> Pholas dactylus 

<220> 

<221> CDS 

<222> (30) . . (704) 

<400> 1 

gaattcggca cgagtcggaa aagaacaaa atg get tgt ate gtt ttc gtt get 

Met Ala Cys lie Val Phe Val Ala 
1 5 



ctt gtc get eta tgc tta atg caa ccg ggt tec ggt gag gaa gta caa 
Leu Val Ala Leu Cys Leu Met Gin Pro Gly Ser Gly Glu Glu Val Gin 
10 15 20 

tgc gcg atg aat tgg aca caa get aat gaa tat gtg ttc aac gtg gac 
Cys Ala Met Asn Trp Thr Gin Ala Asn Glu Tyr Val Phe Asn Val Asp 
25 30 35 40 

tgg atg acc att ttc ate tac gac tat ggc get caa gag caa ctg tac 
Trp Met Thr He Phe He Tyr Asp Tyr Gly Ala Gin Glu Gin Leu Tyr 

45 50 55 

gaa gat egg get ttg ggg ctg tgt egg att gaa egg gee ggc cca ggt 
Glu Asp Arg Ala Leu Gly Leu Cys Arg He Glu Arg Ala Gly Pro Gly 
60 65 70 

acc aca aaa gee gtc tgg att aac tgg agt aac gac acg cag tea tgt 
Thr Thr Lys Ala Val Trp He Asn Trp Ser Asn Asp Thr Gin Ser Cys 
75 80 85 



gta aca aga aaa aca ate ttc ttc gag gtt ggt gga gaa att gee egg 
Val Thr Arg Lys Thr He Phe Phe Glu Val Gly Gly Glu He Ala Arg 
90 95 100 



eta gtt gac tac aga cca cag gaa gac gga act gag aaa act ttt aca 
Leu Val Asp Tyr Arg Pro Gin Glu Asp Gly Thr Glu Lys Thr Phe Thr 
105 - HO 115 120 



tgc get aca agg gac get gat gat aaa tgc ate gaa ggc aca att gtg 
Cys Ala Thr Arg Asp Ala Asp Asp Lys Cys He Glu Gly Thr He Val 
140 145 150 



389 



aga aaa ttc tct age aaa atg cca ggc act tac atg ctt atg gac gtg 437 
Arg Lys Phe Ser Ser Lys Met Pro Gly Thr Tyr Met Leu Met Asp Val 

125 130 135 



485 



gtg aca gtc agg gtg tec eta tat gac gaa gat aac aat ggt gta atg 533 
Val Thr Val Arg Val Ser Leu Tyr Asp Glu Asp Asn Asn Gly Val Met 
155 160 165 



gat gaa ggt aag gtg att cca tct gag aca ate gag gat gat ate aag 
Asp Glu Gly Lys Val He Pro Ser Glu Thr He Glu Asp Asp He Lys 
170 175 180 



581 



gac tgt ggg etc tta gac caa gat gtt gaa etc gat tat acg tgg act 
Asp Cys Gly Leu Leu Asp Gin Asp Val Glu Leu Asp Tyr Thr Trp Thr 
185 190 195 200 



caa aac gag tgt gat eta cca gac aca gta gac gag get gaa gac aca 677 
Gin Asn Glu Cys Asp Leu Pro Asp Thr Val Asp Glu Ala Glu Asp Thr 

205 210 215 



ccg tea gaa act gga gaa ttc ttc tgg tagatctatc agactacttt 724 
Pro Ser Glu Thr Gly Glu Phe Phe Trp 
220 225 



tatcagcagg acaactggtc gttaccagac acctataacg tgtcctcatc aataatgtgt 784 

aaaacagaaa taatcgatag aatattgaaa ataaaatgtt aataaacact ggttgaaata 844 

tgaaaaaaaa aaaaaaaaaa ctcgag 8 70 

<210> 2 
<211> 816 
<212> DNA 

<213> Pholas dactylus 



<400> 2 

gaatteggea cgagggaaaa gaacaaaatg gcttgtatcg ttttcgttgc tettgtcget 60 
etatgettaa tgcaaccggg ttccggtgag gaagtacaat gcgcgatgaa ttggacacaa 12 0 
gctaatgaat atgtgttcaa cgtggactgg atgaccattt tcatctacga etatggeget 180 
caagagcaac tgtacgagga tegggctttg gggctgtgtc ggattgaacg ggccggccca 24 0 
ggtaccacaa aagccgtctg gattaactgg agtaacgaca cgcagtcatg tgtaacaaga 3 00 
aaaacaatct tcttcgaggt tggtggagaa attgcccggc tagttgacta cagaccacag 3 60 
gaagacggaa ctgagaaaac ttttacaaga aaattctcta geaaaatgee aggcacttac 42 0 
atgcttatgg acgtgtgcgc tacaagggac gctgatgata aatgeatega aggcacaatt 4 80 
gtggtgacag tcagggtgtc cctatatgac gaagataaca atggtgtaat ggatgaaggt 540 
aaggttattc catctgagac aatcgaggat gatatcaagg actgtgggct cttagaccaa 600 
gatgttgaac tcgattatac gtggactcaa aacgagtgtg atctaccaga cacagtagac 660 
gaggctgaag acacaccgtc agaaactgga gaattcttct ggtagatcta tcagaccact 72 0 
tttatcagca ggacaactgg tcgttaccag acacctataa cgtgtcctca tcaataatgt 780 



gtaaaacaga aataatcgat agaatattga aaataa 



816 



<210> 3 
<211> 852 
<212> DNA 

<213> Pholas dactylus 
<400> 3 

gtcggaaaag aacaaaatgg cttgtatcgt 
gcaaccgggt tccggtgagg aagtacaatg 
tgtgttcaac gtggactgga tgaccatttt 
gtacgaggat cgggctttgg ggctgtgtcg 
agccgtctgg attaactgga gtaacgacac 
cttcgaggtt ggtggagaaa ttgcccggct 
tgagaaaact tttacaagaa aattctctag 
cgtgtgcgct acaagggacg ctgatgataa 
cagggtgtcc ctatatgacg aagataacaa 
atctgagaca atcgaggatg atatcaagga 
cgattatacg tggactcaaa acgagtgtga 
cacaccgtca gaaactggag aattcttctg 
gacaactggt cgttaccaga cacctataac 
ataatcgata gaatattgaa aataaaatgt 
aaaaaactcg ag 



tttcgttgct cttgtcgctc tatgcttaat 60 
cgcgatgaat tggacacaag ctaatgaata 12 0 
catctacgac tatggcgctc aagagcaact 180 
gattgaacgg gccggcccag gtaccacaaa 240 
gcagtcatgt gtaacaagaa aaacaatctt 3 00 
agttgactac agaccacagg aagacggaac 360 
caaaatgcca ggcacttaca tgcttatgga 42 0 
atgcatcgaa ggcacaattg tggtgacagt 480 
tggtgtaatg gatgaaggta aggttattcc 54 0 
ctgtgggctc ttagaccaag atgttgaact 600 
tctaccagac acagtagacg aggctgaaga 660 
gtagatctat cagaccactt ttatcagcag 72 0 
gtgtcctcat caataatgtg taaaacagaa 7 80 
taatagacac tggttgaaaa aaaaaaaaaa 84 0 

852 



<210> 4 
<211> 225 
<212> PRT 

<213> Pholas dactylus 



<400> 4 

Met Ala Cys lie 
1 

Pro Gly Ser Gly 
20 

Asn Glu Tyr Val 
35 

Tyr Gly Ala Gin 
50 

Arg lie Glu Arg 
65 

Trp Ser Asn Asp 



Glu Val Gly Gly 
100 

Asp Gly Thr Glu 
115 

Gly Thr Tyr Met 
130 



Val Phe Val Ala 
5 

Glu Glu Val Gin 



Phe Asn Val Asp 
40 

Glu Gin Leu Tyr 
55 

Ala Gly Pro Gly 
70 

Thr Gin Ser Cys 
85 

Glu lie Ala Arg 



Lys Thr Phe Thr 
120 

Leu Met Asp Val 
135 



Leu Val Ala Leu 
10 

Cys Ala Met Asn 
25 

Trp Met Thr lie 



Glu Asp Arg Ala 
60 

Thr Thr Lys Ala 
75 

Val Thr Arg Lys 
90 

Leu Val Asp Tyr 
105 

Arg Lys Phe Ser 



Cys Ala Thr Arg 
140 



Cys Leu Met Gin 
15 

Trp Thr Gin Ala 
30 

Phe lie Tyr Asp 
45 

Leu Gly Leu Cys 



Val Trp lie Asn 
80 

Thr lie Phe Phe 
95 

Arg Pro Gin Glu 
110 

Ser Lys Met Pro 
125 

Asp Ala Asp Asp 



Lys Cys lie 
145 



Glu Gly Thr He Val Val Thr Val Arg Val Ser Leu Tyr 
150 155 160 



Asp Glu Asp Asn Asn Gly Val Met Asp Glu Gly Lys Val He Pro Ser 

165 170 175 

Glu Thr He Glu Asp Asp He Lys Asp Cys Gly Leu Leu Asp Gin Asp 
180 185 190 

Val Glu Leu Asp Tyr Thr Trp Thr Gin Asn Glu Cys Asp Leu Pro Asp 
195 200 205 

Thr Val Asp Glu Ala Glu Asp Thr Pro Ser Glu Thr Gly Glu Phe Phe 
210 215 220 

Trp 
225 



<210> 5 
<211> 205 
<212> PRT 

<213> Pholas dactylus 
<400> 5 

Glu Glu Val Gin Cys Ala Met Asn Trp Thr Gin Ala Asn Glu Tyr Val 
15 10 15 

Phe Asn Val Asp Trp Met Thr He Phe He Tyr Asp Tyr Gly Ala Gin 
20 25 30 

Glu Gin Leu Tyr Glu Asp Arg Ala Leu Gly Leu Cys Arg He Glu Arg 
35 40 45 

Ala Gly Pro Gly Thr Thr Lys Ala Val Trp He Asn Trp Ser Asn Asp 
50 55 60 

Thr Gin Ser Cys Val Thr Arg Lys Thr He Phe Phe Glu Val Gly Gly 
65 70 75 80 

Glu He Ala Arg Leu Val Asp Tyr Arg Pro Gin Glu Asp Gly Thr Glu 

85 90 95 

Lys Thr Phe Thr Arg Lys Phe Ser Ser Lys Met Pro Gly Thr Tyr Met 
100 105 HO 

Leu Met Asp Val Cys Ala Thr Arg Asp Ala Asp Asp Lys Cys He Glu 
115 120 125 

Gly Thr He Val Val Thr Val Arg Val Ser Leu Tyr Asp Glu Asp Asn 
130 135 140 

Asn Gly Val Met Asp Glu Gly Lys Val He Pro Ser Glu Thr He Glu 
145 150 155 160 

Asp Asp He Lys Asp Cys Gly Leu Leu Asp Gin Asp Val Glu Leu Asp 



165 



170 



175 



Tyr Thr Trp Thr Gin Asn Glu Cys 
180 

Ala Glu Asp Thr Pro Ser Glu Thr 
195 200 



Asp Leu Pro Asp Thr Val Asp Glu 
185 190 

Gly Glu Phe Phe Trp 

205 



<210> 6 
<211> 225 
<212> PRT 

<213> Pholas dactylus 
<400> 6 

Met Ala Cys lie Val Phe Val Ala Leu Val Ala Leu Cys Leu Met Gin 
15 10 15 

Pro Gly Ser Gly Glu Glu Val Gin Cys Ala Met Asn Trp Thr Gin Ala 
20 25 30 

Asn Glu Tyr Val Phe Asn Val Asp Trp Met Thr lie Phe lie Tyr Asp 
35 40 45 

Tyr Gly Ala Gin Glu Gin Leu Tyr Glu Asp Arg Ala Leu Gly Leu Cys 
50 55 60 

Arg lie Glu Arg Ala Gly Pro Gly Thr Thr Lys Ala Val Trp lie Asn 
65 70 75 80 

Trp Ser Asn Asp Thr Gin Ser Cys Val Thr Arg Lys Thr lie Phe Phe 

85 90 95 

Glu Val Gly Gly Glu lie Ala Arg Leu Val Asp Tyr Arg Pro Gin Glu 
100 105 110 

Asp Gly Thr Glu Lys Thr Phe Thr Arg Lys Phe Ser Ser Lys Met Pro 
115 120 125 

Gly Thr Tyr Met Leu Met Asp Val Cys Ala Thr Arg Asp Ala Asp Asp 
130 135 140 

Lys Cys lie Glu Gly Thr lie Val Val Thr Val Arg Val Ser Leu Tyr 
145 150 155 160 

Asp Glu Asp Asn Asn Gly Val Met Asp Glu Gly Lys Val lie Pro Ser 

165 170 175 

Glu Thr lie Glu Asp Asp lie Lys Asp Cys Gly Leu Leu Asp Gin Asp 
180 185 190 

Val Glu Leu Asp Tyr Thr Trp Thr Gin Asn Glu Cys Asp Leu Pro Asp 
195 200 205 

Thr Val Asp Glu Ala Glu Asp Thr Pro Ser Glu Thr Gly Glu Phe Phe 
210 215 220 



Trp 
225 



<210> 7 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> modif ied__base 
<222> (3) 
<223> i 

<400> 7 

acnathttyt tycargt 



<210> 8 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> modif ied_base 

<222> (12) 

<223> A, T, C or G 

<220> 

<221> modif ied_base 
<222> (15) 
<223> i 

<400> 8 

cargargarg gnacnga 



<210> 9 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<2 21> modif ied_base 
<222> (3) 
<223> i 



<220> 

<221> modif ied_base 
<222> (6) 

<223> A, T, C or G 
<400> 9 

tcngtnccyt cytcytg 



<210> 10 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> modif ied_base 
<222> (9) 
<223> i 

<400> 10 

ttyaaygtng aytggatg 



<210> 11 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 11 

acacagcccc aaagcccgat 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 12 

ttgcccggct agttgactac 



<210> 13 
<211> 24 
<212> DNA 

<213> Artificial sequence 



8 



<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 13 

catatttcaa ccagtgttta ttaa 24 



<210> 14 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 14 

caattgtgcc ttcgatgca 19 



<210> 15 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 15 

ggactgtggg ctcttag 17 



<210> 16 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 16 

atggcttgta tcgttttcgt 2 0 



<210> 17 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Description of Artificial Sequence: Synthetic 
oligonucleotide 



9 



Is : 



<400> 17 

ccacacggat cctgaggaag tacaatg 2 7 



<210> 18 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 18 

ccacacggat ccttattgat gaggaca 2 7 



<210> 19 
p <211> 53 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 19 

cttgttttta tggtcgtcta catttcttac atctatgcgg aggaagtaca atg 53 



<210> 20 
<211> 54 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 20 

ccacacagat ctagaatgaa attcttagtc aacgttgccc ttgtttttat ggtc 54 



<210> 21 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 21 

tttactgttt tcgtaacagt tttg 24 



<210> 22 
<211> 20 



10 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 22 

caacaacgca cagaatctag 20 



<210> 23 
<211> 726 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Clone 40 
amplified by rTth DNA pol XL 

<220> 

<221> modif ied_base 
<222> (644) 

<223> A, T, C, G, other or unknown 



<400> 23 

tatgcttaat gcaaccgggt tccggtgagg aagtacaatg cgcgatgaat tggacacaag 60 
ctaatgaata tgtgttcaac gtggactgga tgaccatttt catctacgac tatggcgctc 12 0 
aagagcaact gtacgaagat cgggctttgg ggctgtgtcg gattgaacgg gccggcccag 180 
gtaccacaaa agccgtctgg attaactgga gtaacgacac gcagtcatgt gtaacaagaa 240 
aaacaatctt cttcgaggtt ggtggagaaa ttgcccggct agttgactac agaccacagg 3 00 
aagacggaac tgagaaaact tttacaagaa aattctctag caaaatgcca ggcacttaca 360 
tgcttatgga cgtgtgcgct acaagggacg ctgatgataa atgcatcgaa ggcacaattg 42 0 
tggtgacagt cagggtgtcc ctatatgacg aagataacaa tggtgtaatg gatgaaggta 480 
aggtgattcc atctgagaca atcgaggatg atatcaagga ctgtgggctc ttagaccaag 54 0 
atgttgaact cgattatacg tggactcaaa acgagtgtga tctaccagac acagtagacg 600 
aggctgaaga cacaccgtca gaaactggag aattcttctg gtanatctat cagactactt 660 
ttatcagcag gacaactggt cgttaccaga cacctataac gtgtcctcat caataatgtg 72 0 
taaaac 726 



<210> 24 
<211> 34 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 24 

Asn Leu Arg Asp Glu Asp Asn Asn Leu Leu Asp Glu Asn Gly Asp Leu 
15 10 15 

Leu Pro Leu Glu Ser Leu Glu Leu Asp Gin Asp Val Glu Leu Asp Tyr 
20 25 30 



Thr Trp 



11 



<212> PRT 

<213> Cyprinus carpio 
<400> 25 

lie Met Gin Lys Gly Glu Leu Val Pro Leu Asp Thr Val Leu Asp Met 
15 10 15 

He Lys Asp Ala Met He Ala Lys Ala Asp Val Ser Lys Gly Tyr 
20 25 30 



<210> 26 
<211> 20 
<212> PRT 

<213> Synechocystis sp. 
<400> 26 

Asp Gin Val Gin Ser Leu Met Arg Phe Ser Gin Ser Lys Gin He He 
15 10 15 

Phe Asn Phe Asp 
20 



<210> 27 
<211> 14 
<212> PRT 

<213> Emericella nidulans 
<400> 27 

lie Met Cys Ser Val Asp Trp Thr Arg Arg Asn Arg Phe He 
15 10 



<210> 28 
<211> 14 
<212> PRT 

<213> Drosophila melanogaster 
<400> 28 

Pro Asp Thr Tyr Asp Glu Glu Glu Asp Thr Tyr Thr His Thr 
15 10 



<210> 29 
<211> 13 
<212> PRT 

<213> Peptococcus niger 
<400> 29 

Asp Pro He Asp Glu Ala Gly Glu Val Pro Ser Glu Thr 
15 10 



<210> 30 
<211> 25 
<212> PRT 
<213> Homo 



sapiens 



12 



<400> 30 

Asp Asp Asp Gly lie Gly Tyr Val Glu Asp Gly Arg Glu lie Phe Asp 
15 10 15 

Asp Asp Leu Glu Asp Asp Ala Leu Asp 
20 25 



<210> 31 

<211> 59 

<212> PRT 

<213> Vargula sp . 

<400> 31 

Tyr Trp Asn Thr Trp Asp Val Lys Val Ser Leu Arg Asp Val Glu Ser 
15 10 15 

Tyr Thr Glu Val Glu Lys Val Thr lie Arg Lys Gin Ser Thr Val Val 
20 25 30 

Asp Leu lie Val Asp Gly Lys Gin Val Lys Val Gly Gly Val Asp Val 
35 40 45 

Ser He Pro Tyr Ser Ser Glu Asn Thr Ser He 
50 55 



<210> 32 

<211> 62 

<212> PRT 

<213> Renilla sp. 

<400> 32 

Ala He Lys He Ala Lys Leu Ser Ala Glu Lys Ala Glu Glu Thr Arg 
15 10 15 

Gly Phe Leu Arg Val Ala Asp Gin Leu Gly Leu Ala Pro Gly Val Arg 
20 25 30 

He Ser Val Glu Glu Ala Ala Val Asn Ala Thr Asp Ser Leu Leu Lys 
35 40 45 

Met Lys Ala Glu Glu Lys Ala Met Ala Val He Gin Ser Leu 
50 55 60 



<210> 33 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Illustrative 
P-loop binding motif 



13 

<400> 33 

Ala Ala Ala Ala Gly Lys Thr 
1 5 



<210> 34 
<211> 4 
<212> PRT 

<213> Photinus pyralis 

<400> 34 
His His Gly Phe 
1 



<210> 35 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 35 

Met Leu Ser Arg Leu Ser Leu Arg Leu Leu Ser Arg Tyr Leu Leu 
15 10 15 



<210> 36 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 36 

Lys Lys Ser Ala Leu Leu Ala Leu Met Tyr Val Cys Pro Gly Lys Ala 
15 10 15 

Asp Lys Glu 



<210> 37 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 



<400> 37 

Met Leu Leu Pro Val Pro Leu Leu Leu Gly Leu Leu Gly Leu Ala Ala 
15 10 15 



14 



<210> 38 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 38 
Lys Asp Glu Leu 
1 



<210> 39 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 39 
His Asp Glu Leu 
1 



<210> 40 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 40 
Lys Glu Glu Leu 
1 



<210> 41 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
illustrative peptide 

<400> 41 

Pro Lys Lys Lys Arg Lys Val 
1 5 



15 



<210> 42 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Illustrative 
N-terminal acylation motif from Tyrosine kinase 



<400> 42 

Met Gly Cys Val Cys Ser Ser Asn Pro Asp 
15 10 



